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ABSTRACT 
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IBM/FSD 

Interactive  Digital  Image  Processing  Investigation 
December  1979  (Unclassified) 


The  objective  of  this  investigation  was  to  develop  an  interactive 
software  package  of  general  utility  that  can  be  used  to  support 
experiments  and  evaluate  a  class  of  interactive/automatic  digital 
techniques  whose  goal  is  feature  extraction/exploitation.  The 
initial  task  was  to  survey  and  select  techniques,  algorithms  and 
supporting  functions  that  were  to  be  included  in  the  system.  A 
maximum  likelihood  supervised  classification  algorithm,  and  an 
unsupervised  classification  (clustering)  method, based  eo—tha — 
ISODATA  algorithm  were -««leeted .  These  and  the  associated  support 
and  evaluation  functions  were  developed  and  implemented  on  the  ETtr^ 
Digital  Image  Analysis  Laboratory  ('DfAIr)rf  Testing  of  the  system 
was  accomplished  by  performing  experiments  on  areas  for  which  both 
multi-dimensional  images  and  ground  truth  were  available.  Analysis 
of  experimental  results  validated  the  system.  These  experimental 
results  were  in  good  agreement  with  ground  truth  and  further,  super¬ 
vised  and  unsupervised  classification  results  were  in  substantial 
agreement.  This  report  describes  the  data  processing  algorithms/ 
techniques,  software  system,  user  guide  information,  and  the  experi¬ 
ment  procedures  used  along  with  their  results. 
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PREFACE 


This  is  the  final  report  for  the  Interactive  Digital  Image  Processing 
Investigation  Department  of  the  Army  contract  number  DAAK  70-77-C-0166. 
The  purpose  of  the  investigation  was  to  survey  algorithms  for  ex¬ 
tracting  features  from  multi-dimensional  digital  imagery;  to  select 
the  most  promising  algorithms;  and  to  implement  them  on  the  DIAL 
system,  where  algorithm  performance  was  calculated. 

The  report  was  prepared  by  W.  C.  Rice,  R.  J.  Spieler,  and  J.  S.  Shipman. 
We  would  like  to  thank  the  following  people  at.  ETL  for  their  help 
during  the  course  of  this  work: 

Samuel  Barr 
Dave  Jenkins 
Bob  Matos 

John  Moses  (DBA  Systems,  Inc.) 

Dr.  Bryce  Schrock 
James  Stilwell 
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Section  1 
INTRODUCTION 


1.1  BACKGROUND 

The  Digital  Image  Analysis  Laboratory  (DIAL)  at  the  Engineer  Topo¬ 
graphic  Laboratories  is  one  of  the  most  advanced  interactive  digital 
image  processing  facilities  in  the  country.  The  computer  and  display 
subsystems  in  combination  with  the  large  repertory  of  applications 
programs  enable  the  user  to  apply  interactively  a  variety  of  digital 
image  processing  techniques  to  problems  in  photointerpretation,  stereo 
compilation,  and  automatic  cartography.  A  complete  range  of  display 
operations  and  many  digital  image  processing  techniques  such  as  image 
rotation  and  warping  are  available  as  callable  routines.  Several  fea¬ 
ture  extraction  algorithms  have  been  implemented  to  take  advantage  of 
the  throughput  capabilities  of  the  STARAN  associative  processor.  These 
capabilities  include  differentiation,  mask  comparison  (which  yields 
information  about  the  direction  of  edges),  and  convolution.  However, 
up  until  the  time  the  effort  described  in  this  report  was  initiated, 
none  of  the  feature  extraction  algorithms  typified  by  the  maximum 
likelihood  classification  method  used  in  NASA's  Large  Area  Crop  Identi¬ 
fication  Experiment  (LACIE)  was  available  to  DIAL  users.  It  was  the 
purpose  of  this  effort  to  select  several  proven  feature  extraction 
algorithms  of  this  type  from  those  which  have  been  documented,  imple¬ 
ment  those  chosen  as  DIAL  callable  program  modules  (PMs) ,  and  test  and 
validate  the  PMs  with  multi-channel  digital  images  commonly  processed 
by  these  algorithms. 


Two  algorithms  were  selected  for  implementation,  a  supervised  class¬ 
ification  scheme  based  on  the  maximum  likelihood  approach  to  pattern 
recognition,  and  an  unsupervised  clustering  scheme  based  on  the  ISODATA 
algorithm  developed  by  Ball  and  Hall  at  Stanford  Research  Institute. 
Supervised  classification  requires  some  apriori  knowledge  of  the  image 
data  to  be  classified,  known  as  "ground  truth"  in  the  LACIE  applica¬ 
tion,  while  unsupervised  classification  does  not.  On  the  other  hand, 
supervised  classification  is  usually  computationally  faster  and  more 
accurate  than  unsupervised  classification.  The  algorithms  chosen  are  , 
therefore,  complementary.  In  addition,  they  have  been  successfully 
applied  to  several  types  of  digital  imagery,  not  just  Landsat  agri¬ 
cultural  scenes,  and  have  been  implemented  in  a  number  of  digital 
image  processing  facilities  of  varying  hardware  configurations.  The 
two  algorithms  are  then  natural  choices  for  the  DIAL  application. 

The  classification  techniques  require  as  supporting  capabilities  the 
building  of  composite  (multi-channel)  images  from  single  channel 
source  images,  the  definition  of  fields  in  the  image  to  be  used  as 
training  fields,  classes,  and  fields  to  be  classified,  the  computation 
of  class  statistics  (signatures),  and  the  display  of  class  maps  and 
the  generation  of  classification  reports.  These  capabilities  are  im¬ 
plemented  in  five  PMs  callable  by  DIAL  users,  INTERL,  FIELDEF, 

CLASTAT,  MAXLIK,  and  CLUSTER  which  communicate  with  each  other 
through  disk  files.  The  PMs  were  tested  and  the  algorithms  validated 
on  two  sets  of  image  data  and  are  now  available  to  DIAL  users  for 
application  to  problems  in  feature  extraction. 
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1.2  REPORT  ORGANIZATION 


Section  1  contains  background  information  and  the  report  organization. 

Section  2  contains  a  review  of  the  feature  extraction  package,  which 
consists  of  the  five  PMs  INTERL,  FIELDEF,  CLASTAT,  MAXLIK,  and  CLUSTER, 
directed  toward  the  user. 

Section  3  includes  a  detailed  user's  guide  to  the  package,  including 
copies  of  the  menus  encountered  by  the  user,  and  a  documentation  of 
the  PMs  which  make  up  the  package.  The  documentation  is  in  high-level 
program  design  language  (PDL)  for  the  major  programs,  with  conven¬ 
tional  descriptions  of  the  principal  subroutines  called.  Copies  of 
complete  commented  computer  listings  have  been  deposited  with 
J.  E.  Stilwell  of  USAETL. 

Section  4  discusses  the  results  of  the  experimentation.  The  data  sets 
are  described,  the  principal  steps  in  the  classification  procedure 
explained,  and  the  classification  results  interpreted.  Copies  of  the 
displayed  reports  and  photographs  of  a  set  of  the  class  maps  are 
included. 

Section  5  presents  the  conclusions  of  the  investigation,  and  recom¬ 
mendations  for  further  work  to  extend  and  enhance  the  DIAL  feature 
extraction  capability  provided  by  the  present  package. 


Section  2 

FEATURE  EXTRACTION  OVERVIEW 


This  section  presents  an  overview  of  the  Feature  Extraction  capability 
of  DIAL.  In  the  overview  are  discussed  the  overall  structure  and 
general  user  information.  It  is  assumed  that  the  reader  has  some 
familiarity  with  DIAL. 

2.1  OVERALL  STRUCTURE 

The  feature  extraction  package  described  in  this  report,  which  adds  to 
the  DIAL  system  the  capability  of  performing  both  supervised  and  un¬ 
supervised  classification,  consists  of  five  interrelated  processing 
modules  (PMs) .  The  PMs  communicate  thru  the  use  of  image  and  param¬ 
eter  data  sets.  The  five  PMs  are 

INTERL  -  Builds  a  composite  image  of  up  to  24  channels  in  the 
band-interleaved-by-pixel  format  from  images  resident 
on  disk,  and  stores  the  resulting  image  on  disk. 

FIELDEF  -  Constructs  a  parameter  data  set  (field/ class)  of  fields 
associated  with  a  designated  composite  image.  The 
fields  can  be  closed  polygons  or  connected  linear  seg¬ 
ments  of  up  to  twelve  sides  or  segments.  FIELDEF  in¬ 
cludes  facilities  for  displaying  a  composite  image; 
designating  a  portion  of  that  image  for  magnification; 
and  displaying  the  fields  defined  superimposed  on  the 
composite  image. 
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CLASTAT  -  Computes  the  channel  means  and  covariance  matrix  for 
previously  defined  fields  to  form  classes,  as  well  as 
the  Bhattacharyya  distance  between  classes  (  a  measure 
of  their  separation) . 

CLUSTER  -  Performs  unsupervised  classification  (clustering)  of  a 
previously  defined  composite  image  according  to  the 
ISODATA  algorithm.  Included  are  facilities  for  display¬ 
ing  a  channel  of  the  image  to  be  clustered,  and  the 
resulting  cluster  map. 

MAXLIK  -  Performs  supervised  classification  according  to  the 
maximum  likelihood  or  Bayes*  algorithms. 

The  interfaces  and  data  sets  used  by  the  PMs  are  shown  in  Figure  2.1-1. 
To  take  advantage  of  the  similarity  of  functions  the  PMs  perform,  a 
significant  set  of  common  subroutines  was  developed  and  is  used  by  all 


2.2  GENERAL  USER  INFORMATION 

A  typical  classification  sequence  is  given  in  Figure  2.2-1.  Cluster¬ 
ing  and  maximum  likelihood  classification  are  done  on  multichannel 
(composite)  images,  so  that  the  first  step  is  the  construction  of  an 
appropriate  composite  image  consisting  of  the  channels  to  be  used. 

For  example,  if  it  were  decided  that  bands  5,  6,  and  7  of  a  four-band 
LANDSAT  MSS  image  were  to  be  classified,  a  three  channel  composite 
image  would  be  built  using  the  INTERL  PM.  Once  the  composite  image 
is  made  it  can  be  used  until  it  is  purged  and  does  not  need  to  be 
recreated  before  each  classification  run. 
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INTERL 


FIELDEF 


CLASTAT 


CLUSTER 


Next  comes  field  definition  (FIELDEF).  The  fields  defined  there  serve 
two  purposes.  Some  will  be  used  to  define  the  area  to  be  classified 
or  clustered,  while  others  will  become  classes  (training  fields)  for 
which  statistics  will  be  computed  in  CLASTAT.  Further,  both  the 
MAXLIK  and  CLUSTER  PMs  perform  on  the  minimum  rectangle  encompassing 
all  the  fields  selected. 

If  supervised  classification  (MAXLIK)  is  to  be  performed  CLASTAT  would 
be  called  next  to  compute  class  signatures  (the  use  of  classes  is 
optional  in  unsupervised  classification) .  Finally  MAXLIK  or  CLUSTER 
would  be  initiated.  In  many  applications  it  is  advantageous  to  classify 
a  given  image  by  both  supervised  and  unsupervised  algorithms.  In  such 
cases  MAXLIK  and  CLUSTER  would  be  called  one  after  the  other  and  both 
results  would  be  analyzed.  This  process  in  many  cases  leads  to  further 
runs. 

In  general,  each  PM  is  self-explanatory  through  the  menu  sequence, 
which  requests  the  user  to  make  certain  choices  and  to  enter  certain 
parameters.  If  a  user  has  some  knowledge  of  the  principles  of  pattern 
recognition  he  should  have  no  difficulty  using  this  package  for  his 
application. 


Section  3 

PROCESSING  MODULE  DESCRIPTION 


3.1  INTERL  PROCESSING  MODULE 

INTERL  is  the  processing  module  by  which  single  channel  DIAL  images  are 
composed  into  multi-channel,  band- inter leaved-by-pixel,  DIAL  images. 

The  image/scene  that  results  from  the  process  will  be  called  a  com¬ 
posite  image;  this  definition  is  used  throughout  the  report.  Composite 
images  can  have  up  to  24  channels,  and  all  the  channels  are  assumed  to 
be  registered. 


3.1.1  INTERL  User  Information 

Upon  entering  the  INTERL  processing  module  (PM)  the  user  is  asked  if  a 
new  composite  image  is  to  be  created  or  if  channels  are  to  be  added  to 
an  existing  composite  image.  If  a  new  composite  image  is  to  be  created, 
then  requests  are  made  to 

o  name  the  image 

o  enter  an  image  specification,  and 

o  enter  the  pixel  size  (number  of  bits  per  pixel)  which 
can  be  different  from  that  of  the  input  images. 

When  using  an  existing  composite  image,  the  user  enters  the  image  name 
or  a  carriage  return  and  selects  from  the  library.  Upon  completing  the 
above  the  menu  given  in  Figure  3.1-1  is  displayed  and  the  processing  is 
controlled  by  the  option  selected.  This  menu  is  redisplayed  upon  com¬ 
pletion  of  processing  for  the  selected  option.  The  exception  is  option 
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Figure  3.1-1  INTERL  Options  Menu 


six  which  terminates  processing. 


1.  List  of  Composite  Image  Channels 

This  option  displays  on  the  Tektronix  a  list  of  all  the 
channels  associated  with  the  composite  image.  "Associated" 
means  that  if  the  next  option  selected  is  Build  Composite 
Image,  then  the  composite  would  have  the  channels  displayed. 
The  reason  "associated"  is  used  is  that  it  is  possible  to 
delete  or  add  images  to  this  composite  before  a  build  is 
performed.  See  Figure  3.1-2  for  an  example  of  the  display 
this  option  generates.  It  states  that  composite  image 
ERIMSYN  has  6  channels;  the  names  and  sequence  are  as  given 
in  the  figure. 

2.  List  of  Channel  Image  Correspondence 

Like  Option  1,  this  option  gives  the  user  information  about 
the  composite  image  to  be  built.  It  differs  in  that  a  user 
may  add  composite  images  to  the  composite  being  created. 
Under  this  situation,  the  user  needs  to  know  the  channel/ 
composite  image  correspondence  because  when  composite  images 
are  added  or  deleted  all  channels  of  the  composite  are  in¬ 
cluded  in  the  operation.  Figure  3.1-3  gives  an  example  of 
the  display. 

3.  Add  Image  to  Composite 

This  option  just  requests  an  image  name  to  be  added  to  the 
composite.  This  can  be  accomplished  by  either  typing  in  a 
name  or  hitting  carriage  control  (CR)  and  selecting  from  the 
library.  A  maximum  of  8  images  can  be  accumulated  before  it 
is  necessary  to  either  delete  images  or  build  a  composite. 
This  restriction  is  due  to  the  number  of  permanent  files 
that  can  be  attached. 
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Figure  3.1-2  List  of  Channels 
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Figure  3.1-3  Image  Channel  Correspondence 


4. 


Delete  Image  From  Composite 

After  this  option  is  selected  a  numbered  list  of  the  images 
associated  with  the  primary  composite  are  displayed  on  the 
Tektronix  ,  and  the  user  enters  the  number  of  the  image  (single 
channel  or  multi-channel)  to  be  deleted.  Again,  if  one  is 
deleting  a  composite  image  from  the  primary  composite,  all 
channels  of  that  image  are  deleted. 

5.  Build  Composite  Image 

This  option  is  selected  when  the  user  is  satisfied  with  the 
set  of  images  making  up  the  composite  and  their  sequence.  At 
this  time,  the  actual  building  of  the  label  and  writing  the 
image  to  disk  takes  place.  Because  this  process  can  take  a 
considerable  amount  of  time,  a  status  message  is  displayed 
on  the  Tektronix.  Also,  the  composite  image  is  closed  and  not 
cataloged  which  means  that  the  user  must  save  the  image  via 
DIAL  routines  or  it  will  be  deleted  at  LOGOFF. 

3.1.2  INTERL  Engineering  Description 

INTERL's  main  function  is  the  generation  of  a  composite  image  in  a  band- 
interleaved-by-pixel  format.  Each  record  contains  one  packed  image 
line  of  data.  In  this  format  the  unpacked  line  contains  n  channel 
values  from  pixel  1,  followed  by  n  channel  values  for  pixel  2,  etc., 
where  n  is  the  number  of  channels  in  the  composite  image. 

As  an  example  the  buffer  containing  an  unpacked  record  of  a  four  channel 
by  M  pixel  composite  image  would  be  as  given  below. 


I — ! 


I 

1 

I 

1 


J  ’ 

| . 
[ 


Word 

1 

2 

3 

4 


4*M-3 

4*M-2 

4*M-1 

4*M 


Pixel 

1 

Channel 

Pixel 

1 

Channel 

Pixel 

1 

Channel 

Pixel 

1 

Channel 

Pixel 

M 

Channel 

Pixel 

M 

Channel 

Pixel 

M 

Channel 

Pixel 

M 

Channel 

The  maximum  number  of  channels  for  a  composite  is  24  and  the  actual 
number  of  channels  is  stored  in  word  365  of  the  DIAL  label.  The  label 
contains  standard  DIAL  type  information;  also  there  is  a  photogram- 
metric  parameters  record  that  contains  the  channels  and  images  making 
up  the  composite.  The  number  of  60-bit  words  in  a  record  is 

NWDS  =  (NCHAN*NPIX*NBIT+59) /6 

where: 

NCHAN  =  Number  of  channels 

NPIX  =  Number  of  pixels  per  image  line 

NBIT  =  Number  of  bits/channel 

NWDS  =  Number  of  60-bit  words  in  a  record. 
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Another  important  consideration  is  that  all  the  images  making  up  the 
composite  must  be  registered. 


3.1.3  INTERL  Control  Flow  Processing  Steps 

The  control  flow  of  INTERL  is  determined  by  the  options  selected  by  the 
user.  There  are  two  option  menus,  the  first  of  which  is  only  displayed 
once,  and  the  second  is  displayed  after  each  selected  option  completes 
its  function  or  until  the  user  terminates  the  PM.  The  control  flow/ 
processing  steps  are  given  below  along  with  the  subroutines  used.  DIAL 
subroutines  and  system  routines  are  not  identified. 

Control  Flow  Subroutines  Used 

1.  Initialize  processing. 

2.  Display  first  option  menu 


A. 

B. 

Create  a  new  composite  image 

Use  an  existing  composite  image 

FETEKSC 

Display  second  option  menu 

A.  List  of  composite  image  channels 

LISTCH 

B. 

List  of  channel  image  correspondence 

LISTCH 

C. 

Add  image  to  composite 

ADD IMG 

D. 

Build  composite  image 

PIXINT 

E. 

Terminate  PM 

BLDCOM 

3.1.4 


INTERL  Subroutine  Description 


This  section  describes  all  the  subroutines  identified  in  the  prior 
section  with  the  exception  of  FETEKSC,  which  is  described  in 
Section  3.2.4. 

3. 1.4.1  Subroutine  ADD IMG 

Purpose 

Develops  arrays  containing  descriptions  of  the  images  to  be 
included  in  the  composite. 

Usage 

CALL  ADD IMG  (IREF,  IRENAM,  MAXPFN,  MAXCH,  IERROR) 

Description  of  Parameters 

IREF  -  Input  flag  indicating  if  the  first  image  is  a 
reference  image. 

=0  NO 

=1  YES 

IRENAM  -  Input  flag  indicating  if  renaming  of  image  files 
should  take  place. 

=0  NO 

-1  YES 

MAXPFN  -  Input  variable  describing  the  maximum  number  of  PFN's 
that  can  be  attached  (8) . 

MAXCH  -  Input  variable  describing  the  maximum  number  of 
channels  (24) . 

IERROR  -  Output  error  flag 

“0  No  errors 

*1  Rename  error 

»2  Create  error 

=3  Label  read  error 


3-8 


I 

I 

I 

l 

1 

I 

T 

mm i 


Method 

Based  upon  the  flag  IRENAM  renaming  of  the  composite  already 
in  existence  can  take  place.  The  new  composite  is  given  the 
selected  name  while  the  old  composite  is  named  REFERENCE.  This 
is  needed  when  channels  are  added  to  an  existing  composite. 

A  request  to  the  user  for  the  name  of  the  image  to  be  added 
is  made  and  upon  entry  the  image  is  attached  and  checked  to 
determine  if  it  contains  the  same  number  of  pixels  and  lines 
as  the  base  image.  After  validation,  the  image  characteris¬ 
tics  are  stored  in  arrays  and  the  counters  are  updated. 

Common  Areas  Used 

LINE,  ECS,  PARAMS,  COMIMG,  ASSIMG 
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3.1.4. 2  Subroutine  BLDCOM 

Purpose 

Performs  file  functions  and  builds  label  for  composite  image. 


Usage 

Call  BLDCOM  (IREF,  IERROR) 

Description  of  Parameters 

IREF  -  Input  flag  indicating  if  first  image  is  a  reference 
image 

=0  NO 

=1  YES 

IERROR  -  Output  error  flag 
“0  No  errors 

*1  Cannot  write  composite  image  label 
*21  through  24  Error  in  PIXINT  Routine 
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■3  Cannot  close  composite  image 
‘4  Cannot  read  new  composite  label 


Method 

Builds  and  writes  composite  image  label.  Returns  to  the 
system  all  images  used  in  building  the  composite  and  purges 
the  reference  image  if  there  was  one.  All  the  above  is 
accomplished  with  DIAL  routines. 


COMMON  Areas  Used 

LINE,  ECS,  PARAMS,  COMIMG,  ASSIMG 


.4.3  Subroutine  DELIMG 

Purpose 

Deletes  images  from  list  to  be  used  for  composite. 


Usage 

CALL  DELIMG  (IREF) 

Description  of  Parameters 

IREF  -  Input  flag  indicating  if  the  first  image  is  a 
reference  image. 

-0  NO 

*1  YES 


Method 

Requests  user  to  enter  an  image  name.  If  the  name  is  not 
the  reference  image  name,  the  image  characteristics  are 
removed  from  the  tables,  the  tables  are  compressed  and  the 
image  is  returned  to  the  system. 


COMMON  Areas  Used 


LINE,  ECS,  PARAMS,  COMIMG,  ASSIMG 


3. 1.4. 4  Subroutine  PIXINT 

Purpose 

Generates  a  composite  band-inter leaved-by-pixel  image  file. 


Usage 

Call  PIXINT  ( IERROR) 

Description  of  Parameters 

IERROR  -  Output  error  flag 
*0  No  errors 

=1  Insufficient  central  memory 

=2  Pixel  count  is  not  equal  to  reference  image 

=3  Line  count  is  not  equal  to  reference  image 


Method 

This  routine  is  given  a  set  of  images  which  have  been  located, 
attached,  and  validated.  From  the  image  characteristics  the 
routine  determines  how  to  divide  the  available  central  memory 
and  ECS.  A  result  of  the  above  is  number  of  lines  to  store 
in  ECS  and  this  number  of  lines  is  read  from  the  disk  for 
each  image  and  stored  in  ECS.  A  line  of  the  composite  image 
is  generated  by  unpacking  the  corresponding  lines  from  each 
image  and  storing  the  values  in  appropriate  locations  of  ECS. 
When  the  building  of  the  line  in  ECS  is  completed,  the  line 
is  read  into  central  memory  where  it  is  packed  and  written 
to  disk.  This  is  continued  until  all  lines  have  been,  proc¬ 
essed.  Since  some  of  the  images  can  be  multichannel 
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(composite)  images,  more  than  one  value  from  an  image  can  be 
stored  for  a  given  pixel.  The  output  line  is  in  the  band- 
interleaved-by-pixel  format. 

COMMON  Areas  Used 

LINE,  ECS,  PARMS,  COMIMG,  ASS IMG 

3.2  FIELDEF  PROCESSING  MODULE 

FIELDEF  permits  the  user  to  define  areas  to  be  used  for  training  the 
classifier,  to  be  used  for  classification,  or  to  be  used  for  clustering. 
The  areas  are  termed  "fields."  Fields  can  be  defined  as  a  polygon  with 
up  to  12  sides,  with  all  of  the  pixels  interior  to  the  polygon  con¬ 
sidered  as  the  field,  or  fields  can  be  defined  as  a  series  of  up  to 
12  connected  straight  line  segments,  with  all  of  the  pixels  intersected 
by  the  line  segments  considered  as  the  field. 


3.2.1  User  Information 

It  is  possible  for  the  user  to  operate  FIELDEF  without  entering  a  spe¬ 
cific  name  of  any  of  the  entities  that  must  be  selected  to  provide  in¬ 
formation  to  FIELDEF  (except  for  2  letters  for  display  selection,  and 
the  names  of  the  new  fields).  The  first  menu  (Figure  3.2-1)  that  the 
user  sees  provides  him  with  a  choice  to  define  new  fields  by  displaying 
the  image  and  interacting  with  the  COMTAL  display  and  trackball  to 
specify  field  vertices,  or  to  modify  or  delete  existing  field  defini¬ 
tions.  If  the  user  selects  to  define  new  fields,  the  next  choice  is 
to  specify  the  display (s)  upon  which  the  images  will  be  located 
(Figure  3.2-2).  The  first  letter  specifies  the  display  on  which  the 
full  image,  or  possibly  an  expanded  sub image,  will  be  shown  to  permit 
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Figure  3.2-1.  FIELDEF  Option  Menu 
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the  user  to  locate  the  area  to  be  used  for  field  definition.  The 
second  letter  specifies  the  display  on  which  the  subimage  used  for 
field  definition  will  be  shown  (if  necessary,  both  letters  may  be  the 
same).  The  next  option  menu  (Figure  3.2-3)  permits  the  user  to  select 
a  single  image  data  file  and  display  it  in  gray,  or  select  three  image 
data  files  and  display  the  images  overlayed  in  red,  green,  and  blue. 

The  assumption  is  that  the  three  images  are  different  but  registered 
bands  that  add  information  to  the  red/green/blue  display  for  the  user 
to  choose  fields.  The  next  request  (Figure  3.2-4)  will  be  for  the 
individual  data  set(s)  that  contain  the  image (s)  to  be  displayed.  If 
the  user  does  not  remember  the  name,  a  (CR)  will  provide  a  prompting 
list.  The  next  option  menu  (Figure  3.2-5)  permits  the  user  the  option 
to  select  an  existing  file  to  add  additional  field  definitions,  or  to 
create  a  new  file.  If  the  user  selects  the  existing  file  (Figure  3.2-6) 
he  will  be  prompted  for  the  name  of  the  field/class-file.  As  with  the 
image  file,  if  the  user  does  not  remember  the  name,  a  (CR)  will  pro¬ 
vide  a  prompting  list.  If  the  user  selects  to  create  a  new  file,  he 
will  be  prompted  for  the  file  name,  and  a  file  description.  The  next 
selection  menu  (Figure  3.2-7)  is  the  main  set  of  options  for  defining 
fields.  The  first  option  the  user  would  normally  select  is  #1,  to 
display  the  subimage  used  for  defining  fields.  FIELDEF  will  display 
the  full  image  automatically.  Next,  the  user  has  the  option  to  modify 
the  brightness  and  contrast  by  using  the  trackball,  or  to  leave  the 
image  alone  (Figure  3.2-8).  Subsequent  to  this  a  subarea  must  be 
selected  by  the  user  (Figure  3.2-9).  This  subarea  may  be  expanded  to 
further  examine  the  area,  or  expanded  on  the  second  COMTAL  to  be  used 
for  field  definition  (Figure  3.2-10).  After  a  subimage  has  been  dis¬ 
played  for  field  definition.  Selection  2  (Figure  3.2-7)  will  begin  the 
sequence  of  defining  fields.  The  first  option  menu  (Figure  3.2-11) 
permits  the  user  to  define  areal  or  linear  fields,  or  to  save  a  field 
Just  defined.  Instructions  (Figure  3.2-12)  prompt  the  user  on  how  to 
define  the  field.  After  a  field  has  been  defined,  the  field  name  and 
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Figure  3.2-3  FIELDEF  Image  Type  Menu 
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Figure  3.2-4  FIELDEF  Red  Image  Select  Menu 


Figure  3.2-5  FIELDEF  Field/Class-File  Menu 


Figure  3.2-6  FIELDEF  Field/Class-Data-File  Menu 
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Figure  3.2-7  FIELDEF  Field  Definition  Menu 


Figure  3.2-8  FIELDEF  Image  Function  Tables  Modification  Menu 
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Figure  3.2-9  FIELDEF  Area  Outline  Menu 
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Figure  3.2-12  FIELDEF  Areal  Field  Creation  Menu 


description  will  be  requested  (Figure  3.2-13).  The  user  must  remember 
to  save  the  field  defined  (Figure  3.2-11)  at  the  end  of  definition. 

If  the  user  chooses  option  2  on  the  first  menu  (Figure  3.2-11),  then 
the  next  option  menu  that  appears  is  the  one  shown  in  Figure  3.2-14. 

The  user  then  has  the  choice  to  modify  the  vertices  of  an  existing 
field  record,  to  replace  the  original  record  with  the  modified  record, 
to  delete  selected  records,  or  to  delete  a  field/class-file.  If  the 
user  chooses  option  1,  the  modification  of  the  vertices  of  an  existing 
field,  the  next  menu  displayed  is  shown  in  Figure  3.2-15.  This  is  the 
standard  field  record  selection  menu.  After  a  single  field  is  selected 
and  option  3  chosen,  the  field  modification  display  (Figure  3.2-16)  is 
shown.  One  dimension  of  a  vertex  may  be  modified  at  a  time.  If,  after 
a  modification,  the  field  has  a  segment  that  crosses  another  segment, 
an  error  message  will  appear  at  the  top  of  the  screen.  This  means 
simply  that  the  field  cannot  be  saved  in  the  state  that  it  is  in.  The 
user  may  continue  to  modify  and  complete  the  modification  process.  The 
modified  record  must  be  saved  (replacing  the  original)  as  shown  in 
Figure  3.2-14,  option  2. 

3.2.2  Engineering  Description 

FIELDEF  has  the  capability  to  display  an  image  and  permit  the  user  to 
define  "fields"  that  correspond  to  areas  of  interest.  These  field 
definitions  may  be  utilized  later  as  areas  that  specify  the  pixels  to 
be  used  to  gather  statistics  for  "training"  the  MAXLIK  classifier, 
pixels  to  be  classified,  or  pixels  to  be  clustered  with  the  CLUSTER 
program. 

Fields  may  be  of  two  types,  areal  or  linear.  An  areal  field  consists 
of  a  polygon  with  up  to  a  maximum  of  12  sides.  It  may  be  either  a 
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Figure  3.2-14  FIELDEF  Linear  Field  Creation  Menu 
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Figure  3.2-15  FIELDEF  Field  Modification  Menu 
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Figure  3.2-16  FIELDEF  Vertex  Modification  Menu 


convex  or  a  concave  polygon.  The  pixels  defined  are  those  whose  centers 
are  on  the  polygon  boundary  or  interior  to  the  polygon.  Linear  fields 
consist  of  series  of  one  or  more  connected  straight  line  segments,  up 
to  a  maximum  of  12  segments.  The  pixels  defined  are  those  which  are 
intersected  by  a  field  line  segment. 

Fields  are  stored  simply  as  a  series  of  integer  vertices  with  the 
areal  definitions  always  including  the  closing  segment.  A  standard 
method  is  provided  to  analyze  which  pixels  are  included  in  the  field 
by  using  the  FEUTLLA  subroutine.  FEUTLLA  returns  the  line  segment  or 
segments  included  in  the  field  for  any  particular  line  passed  to  it. 

The  user  specifies  the  field  vertices  by  cursor  position  and  pressing 
the  cursor  button  (normally  "select,"  with  "done"  as  the  last  vertex). 
FIELDEF  automatically  enters  the  last  vertex  and  closes  the  polygon 
when  the  user  presses  "done." 

The  subimage  upon  which  the  user  specifies  the  location  of  the  field  is 
obtained  in  a  two-step  process.  First,  the  entire  image  is  displayed 
on  one  COMTAL.  By  positioning  the  cursor,  the  center  of  a  square  and 
the  size  of  the  square  area  to  be  displayed  is  specified.  This  mag¬ 
nifying  process  can  continue  on  the  original  COMTAL  until  the  area  of 
interest  is  clear.  Then,  a  final  magnification  is  transferred  to  a 
second  COMTAL  to  be  used  for  field  definition. 

Additional  functions  available  in  FIELDEF  are: 

1.  Outline  previously  defined  fields. 

2.  Modify  field  definition  vertex  locations. 
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3.  Delete  field  record(s). 


4.  Delete  field/class  file. 

3.2.3  FIELDEF  PDL 

FIELDEF : BGNSEGMENT  (MAIN) 

TEKDISPLAY  FIELDEF  TITLE  MENU 

CALL  FETEKSO  TO  SELECT  IMAGE  OR  KEYBOARD  OPTION 

CASENTRY  (IMAGE-DEFINE,  KEYBOARD-DEFINE) 

CASE  1  (IMAGE-DEFINE) 

DO  UNTIL  NO  IMAGE  TO  BE  DISPLAYED 

CALL  FETED SD  TO  SELECT  DISPLAYS 

CALL  FETEKSO  TO  SELECT  GRAY  OR  R/G/B  IMAGES 

CASENTRY  (GRAY,  R/G/B) 

CASE  1  (GRAY) 

CALL  FETEKSI  TO  SELECT  GRAY  IMAGE 
CALL  FECOMIV  TO  SET  DISPLAY  VECTORS 
CASE  2  (R/G/B) 

CALL  FETEKSI  3  TIMES  TO  SELECT  R/G/B  IMAGES 
CALL  FECOMIV  TO  SET  DISPLAY  VECTORS 
ENDCASE 

CALL  FETEKFC  TO  SELECT  FIELD /CLASS  FILE 
DO  UNTIL  NO  FURTHER  OPTION 

CALL  FETEKSO  TO  SELECT  1  OF  5  MAIN  OPTIONS 
CASENTRY  (OUTLINE  DEF  IMAGE,  DEFINE  FIELD,  OUTLINE  FIELDS, 
ANOTHER  IMAGE,  END  OF  FIELDEF) 

CASE  1  (OUTLINE  DEFINITION  IMAGE) 

IF  NO  IMAGE  DISPLAYED 

CALL  FECOMDI  TO  DISPLAY  FULL  IMAGE 


CALL  FETEKSO  TO  SELECT  PRESENT  IMAGE  OR  FULL 
IMAGE 

CASENTRY  (PRESENT  IMAGE,  FULL  IMAGE) 

CASE  1  (PRESENT  IMAGE) 

CALL  FECOMDI  OR  FECOMFS  TO  DISPLAY  PRESENT 
IMAGE 

CASE  2  (FULL  IMAGE) 

CALL  FECOMDI  TO  DISPLAY  FULL  IMAGE 
ENDCASE 

ENDIF 

CALL  FETEKSO  TO  SELECT  FUNCTION  TABLE  MODIFY  OR 
SKIP  MODIFY 

IF  FUNCTION  TABLE  MODIFY 

THEN  CALL  GRAYMAP  TO  MODIFY  FUNCTION  TABLE 
ENDIF 

DO  UNTIL  ENOUGH  EXPANSION  ON  COMTAL  1 

SELECT  CENTER  AND  SIZE  OF  SQUARE  TO  BE  EXPANDED 
CALL  FETEKSO  TO  EXPAND  ON  COMTAL  1  OR  END 
CASENTRY  (EXPAND  ON  COMTAL  1,  END) 

CASE  1  (EXPAND  ON  COMTAL  1) 

EXPAND  ON  COMTAL  1 
CASE  2  (END  EXPANSION  ON  COMTAL  1) 

SET  NO  EXPAND  SW. 

ENDCASE 

ENDDO 

EXPAND  ON  COMTAL  2 

CALL  FETEKSO  TO  SELECT  FUNCTION  TABLE  MODIFY  OR 
SKIP  MODIFY 

IF  FUNCTION  TABLE  MODIFY 

THEN  CALL  GRAYMAP  TO  MODIFY  FUNCTION  TABLE  ON 
COMTAL  2 


ENDIF 


CASE  2  (DEFINE  FIELD) 

DO  UNTIL  NO  MORE  FIELDS  TO  BE  DEFINED  OR  CANCELLED 
CALL  FETEKSO  TO  SELECT  OPTION 
CASENTRY  (POLYGON,  LINE,  SAVE,  END) 

CASE  1  (POLYGON  OR  LINE) 

IF  FIRST  POINT 

THEN  ADD  POINT  TO  FIELD  RECORD 
ELSE  CALL  FEFDTAD  TO  CHECK  FOR  ERROR 
IF  OK 

THEN  ADD  POINT 

ELSE  CALL  FETEKEM  TO  DISPLAY  ERROR  MESSAGE 
END  IF 
END  IF 

ENTER  FIELD  NAME 
ENTER  FIELD  DESCRIPTION 
CASE  2  (SAVE) 

IF  FIELD  IS  VALID 

THEN  CALL  FEUTILLA  TO  SAVE  RECORD 

ELSE  CALL  FETEKEM  TO  DISPLAY  ERROR  MESSAGE 

END  IF 

CASE  3  (END) 

SET  NO  FIELDS  SW 
ENDCASE 

ENDDO 

CASE  3  (OUTLINE  FIELDS) 

CALL  FETEKSO  TO  SELECT  IMAGE  TO  OUTLINE  FIELDS 
CALL  FETEKOF  TO  SELECT  FIELDS  TO  DISPLAY 
DISPLAY  ALL  FIELDS  COMPLETELY  CONTAINED  ON  DISPLAY 
CASE  4  (ANOTHER  IMAGE) 

SET  NO  FURTHER  OPTIONS  SW 
CASE  5  (END  OF  FIELD EF) 

SET  NO  FURTHER  OPTIONS  SW 
SET  NO  FURTHER  IMAGE  SW 


ENDCASE 


ENDDO 

ENDDO 

CASE  2  (KEYBOARD  DEFINE) 

DO  UNTIL  NO  FURTHER  OPTION 

CALL  FETEKSO  TO  SELECT  OPTION 

CAS ENTRY  (MODIFY  FIELD, SAVE, DELETEFIELD, DELETE  F/C  FILE, END) 
CASE  1  (MODIFY  FIELD) 

CALL  FETEKFC  TO  SELECT  F/C  FILE 
CALL  FETEKDF  TO  SELECT  FIELD  RECORD 
CALL  FEUTLFC  TO  RETRIEVE  RECORD 
DO  UNTIL  (CR)  ENTERED 
ENTER  FIELD  ITEM 
ENTER  ITEM  VALUE 
IF  NOT  (CR) 

THEN  MODIFY  AND  TEST  FIELD 
END  IF 

ENDDO 

CASE  2  (SAVE)  ^ 

IF  FIELD  IS  OK 

THEN  CALL  FEUTLLA  TO  DELETE  OLD  RECORD 
CALL  FEUTLLA  TO  SAVE  NEW  RECORD 
ELSE  CALL  FETEKEM  TO  DISPLAY  ERROR  MESSAGE 
END  IF 

CASE  3  (DELETE  FIELD) 

CALL  FETEKFC  TO  SELECT  F/C  FILE 
CALL  FETEKDF  TO  SELECT  FIELDS  TO  BE  DELETED 
CALL  FEUTLFC  TO  DELETE  RECORDS 
CASE  4  (DELETE  F/C  FILE) 

CALL  FETEKFC  TO  SELECT  F/C  FILE 
CALL  D PURGE  TO  DELETE  FILE 
CASE  5  (END) 

SET  NO  FURTHER  OPTION  SW 
ENDCASE 


ENDDO 

ENDCASE 

ENDSEGMENT 

3.2.4  FIELDEF  Subroutine  Description 

3. 2.4.1  Subroutine  FECOMDI 

Title  -  Transmits  a  gray  or  red/green/blue  image  to  a  COMTAL  display 
and  displays  the  image. 

Parameters 

CALL  FECOMDI  (FNAMES ,  IMGGRAY,  LBLBUF ,  IDSP,  IXYS) 

FNAMES  -  A  (4,3)  array  that  contains  the  red,  green,  and 
blue  image-data- file  names  in  that  order;  or, 
for  a  gray  image,  the  single  gray  image-data¬ 
file  name. 

IMGGRAY  -  0  -  FNAMES  contains  R/G/B  names 
1  -  FNAMES  contains  a  gray  name 

LBLBUF  -  Label  buffer  supplied  by  the  calling  program, 
at  least  2602  words  long. 

IDSP  -  A  21  element  COMTAL  display  control  vector 
supplied  by  the  calling  program. 

-  A  3-element  vector  returned  to  the  calling 
program  that  includes  the  XY  coordinates  of 
the  registration  point,  and  the  sampling 
factor  (See  subroutine  IMGSIZE) . 


IXYS 


Description  -  This  subroutine  provides  a  standard  means  of  dis¬ 
playing  complete  images  through  the  facilities  of 
subroutine  IMGDSK.  The  calling  program  must  have 
specific  declaration  statements  for  ECS. 

3. 2. 4. 2  Subroutine  FECOMFS 

Title  -  Expands  or  shrinks  a  square  image  or  subimage  to  fit  a  full 
COMTAL  screen  through  linear  interpolation. 

S 

Parameters 

CALL  FECOMFS  ( FNAME , IPC , ILC , NPL , JPC , JLC , JNPL , LBLBUF , BUF , LN , 

LAN,  IANN) 

FNAME  -  The  image-data-f ile  name  that  identifies  the 
source  of  image  data. 

IPC, ILC  -  The  pixel  and  line  location  of  the  center  of 

the  image  or  subimage  to  be  displayed  (requested) . 

NPL  -  The  requested  number  of  pixels  and  lines  to 

be  displayed  on  the  COMTAL  full  screen. 

JPC, JLC,  -  The  actual  pixel  and  line  location  of  the 

JNPL  center,  and  the  number  of  lines  and  pixels 

actually  displayed  on  the  COMTAL  full  screen. 

This  may  differ  from  IPC, ILC,  and  NPL  if  the 
request  implies  an  off-image  extraction. 

LBLBUF  -  Label  buffer  supplied  by  the  calling  program, 
at  least  2602  words  long. 
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BUF  -  Working  expansion  buffer  supplied  by  the 

calling  program,  at  least  12,288  words  long. 

LN  -  Image  plane  that  receives  the  8-bit  image. 

LNA  -  Overlay  plane  that  receives  the  annotation. 

IANN  -  0  -  No  annotation 

1  -  Display  annotation 

Description  -  This  subroutine  provides  all  of  the  function  needed 
to  extract  a  square  subimage  or  image  from  a  disk 
image-data-set  and  expand  or  shrink  that  subimage  or 
image  to  fit  the  full  screen  COMTAL.  Linear  inter¬ 
polation  is  the  resampling  method.  This  subroutine 
does  not  include  invocation  of  subroutine  DISPLAY, 
which  must  be  done  subsequently.  The  calling  program 
must  have  specific  declaration  statements  for  ECS. 

.2.4.3  Subroutine  FECOMIV 

Title  -  Initializes  a  COMTAL  display  control  vector  to  a  standard 
set  of  values. 

Parameters 

CALL  FECOMIV  (COMALPH, IMGGRAY, IMGGLN, IDSP) 

COMALPH  -  The  selected  COMTAL  display  (A,B,C,  or  D) ,  left 
justified  and  zero-filled. 

IMGGRAY  -  0  -  R/G/B  image  to  be  displayed. 

1  -  Gray  image  to  be  displayed. 
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IMGGLN 


The  21  element  COMTAL  display  control  vector 
returned  to  the  calling  program. 


IDSP 


Description 

This  subroutine  provides  a  means  to  initialize  a  COMTAL  dis¬ 
play  control  vector  a  standard  set  of  values.  IMGGLN  is  not 
used  when  IMGGRAY  =  0,  but  a  dummy  entry  must  be  present. 


3. 2.4.4  Subroutine  FETEKCL 


Description 

This  subroutine  provides  a  simple  means  to  clear  the 
Tektronix  display. 


3 . 2 . 4 . 5  Sub rout ine  FETEKDF 


Title 


Displays  the  field  or  class  records  in  a  field/class-file, 
permits  the  user  to  select  one  or  more  records,  and  returns 
a  series  of  pointers  that  point  to  the  location  of  the 
selected  records. 


Parameters 

CALL  FETEKDF  (FNAME,  LBLBFR,IFC,MG1,NC,L0CV,MAXV,NV) 

FNAME  -  Field/Class-f ile  that  contains  the  field  or 
class  records  to  be  selected. 

LBLBFR  -  A  supplied  work  area  for  the  label. 

IFC  -  0  -  Display  field  records  only. 

1  -  Display  class  records  only. 

MG1  -  A  character  string  to  be  added  to  the  first  line 
(SELECT  FIELD (S)  or  SELECT  CLASS (ES))  as  expla¬ 
nation  for  or  about  the  selection  (up  to  50 
characters) . 

NC  -  Number  of  characters  in  MG1. 

LOCV  -  A  single  dimension  array  of  pointers  returned 

that  are  the  indexes  of  the  selected  records  in 
the  file. 

MAXV  -  Maximum  number  of  record  pointers  permitted  to 
be  returned.  If  more  are  selected,  the  amount 
will  be  truncated. 

NV  -  Number  of  record  pointers  selected. 

Description 

This  subroutine  provides  a  standard  means  of  displaying  the 
field  data  or  class  data  in  a  field/class-file,  and  permitting 
the  user  to  select  one  or  more  of  the  fields  or  classes. 


3. 2. 4. 6  Subroutine  FETEKEM 

Title  -  Displays  a  single-line  error  message  on  the  Tektronix 
display  in  a  standard  format. 

Parameters 

CALL  FETEKEM  (MSG, NC , ICLEAR) 

MSG  -  The  error  message  to  be  displayed — may  be  up  to 
81  characters.  Formatting  control  characters 
should  not  be  included. 

NC  -  The  number  of  characters  in  the  error  message. 

ICLEAR  -  0  -  Screen  not  cleared  before  display. 

1  -  Screen  cleared  before  display. 

Description 

This  subroutine  provides  a  standard  method  to  display  single- 
line  error  messages.  The  message  is  displayed  with  a  line 
of  dashes  above  and  below,  and  the  Tektronix  horn  is  beeped. 

3 . 2 . 4 . 7  Subrout ine  FETEKFC 

Title  -  Displays  a  request  for  a  f ield/class-data-f ile  name  on  the 
Tektronix  display,  and  returns  the  name  to  the  calling 
program. 

Parameters 

CALL  FETEKFC  (FNAME,  IOLINE.NREC, ICLEAR) 


3-41 


FNAME  -  Field/class-file  name  returned  by  the  subroutine 
(40  chars.) 

IOLINE  -  0  -  Select  existing  file  only 

1  -  User  has  choice  to  select  existing  file,  or 
create  new  file. 

NREC  -  Number  of  records  in  the  field/class  file 

ICLEAR  -  0  -  Screen  not  cleared  before  display 
1  -  Screen  cleared  before  display 

Description 

This  subroutine  provides  a  display  that  requests  a  field/ 
class-data-f ile  name,  and  uses  LOCATE  to  confirm  and  return 
the  name  to  the  calling  program,  or,  uses  KREATE  to  confirm, 
return  the  name,  and  create  a  new  file. 

3. 2. 4. 8  Subroutine  FETEKSC 

Title  -  Displays  a  request  for  a  composite-data-f ile  name  on  the 
Tektronix  display,  and  returns  the  name  to  the  calling 
program. 

Parameters 

CALL  FETEKSC  (FNAME.NCH.LBLBUF, ICLEAR) 

FNAME  -  Composite-file  name  returned  by  the  subroutine 
(40  chars.) 

NCH  -  Number  of  channels  included  in  the  composite 
file,  returned  by  the  subroutine. 
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LBLBUF  -  Label  buffer  supplied  by  the  calling  program, 
at  least  2602  words  long. 

ICLEAR  -  0  -  Screen  not  cleared  before  display. 

1  -  Screen  cleared  before  display.  1 

Description 

This  subroutine  provides  a  display  that  requests  a  composite- 
data-file  name,  and  uses  LOCATE  to  confirm  and  return  the  name 
to  the  calling  program. 


3. 2. 4. 9  Subroutine  FETEKSD 

Title  -  Displays  a  request  for  image  display  selection  on  the 

Tektronix  screen  and  reads  a  pair  of  letters  corresponding 
to  the  users  request.  The  letters  are  returned  to  the 
calling  program. 

Parameters 

CALL  FETEKSD  (DIS1.DIS2, ICLEAR) 

DIS1  -  The  first  letter  (A,B,C  or  D)  is  returned,  left 
justified  and  zero-filled. 

DIS2  -  The  second  letter  is  returned,  left  justified 
and  zero-filled. 


ICLEAR  -  0  -  Screen  not  cleared  before  display 
1  -  Screen  cleared  before  display 


Description 

This  subroutine  provides  a  header  'SELECT  DISPLAY (S) ' ,  fol¬ 
lowed  by  an  explanation  of  the  entry  required  by  the  user. 

The  entered  letters  are  returned  via  the  above  parameters. 
This  subroutine  is  designed  to  be  used  with  subroutine 
FECOMIV  to  build  a  control  vector  for  the  COMTAL  display. 

3.2.4.10  Subroutine  FETEKSI 

Title  -  Displays  a  request  for  an  image-data-f ile  name  on  the 

Tektronix  display,  and  uses  LOCATE  to  return  the  file  name 

Parameters 

CALL  FETEKSI  (FNAME,  ICOLOR, ICLEAR) 

FNAME  -  The  image-data-f ile  name  returned  by  the 
subroutine. 

ICOLOR  -  0  -  "GRAY  IMAGE" 

1  -  "RED  IMAGE" 

2  -  "GREEN  IMAGE" 

3  -  "BLUE  IMAGE" 

ICLEAR  -  0  -  Screen  not  cleared  before  display. 

1  -  Screen  cleared  before  display. 

Description 

This  subroutine  provides  a  display  that  requests  an  image- 
data-file  name,  and  uses  LOCATE  to  confirm  and  return  the 
name  to  the  calling  program.  The  parameter  ICOLOR  changes 
the  display  wording  only. 


3.2.4.11  Subroutine  FETEKSO 


Title  -  Displays  a  list  of  options  on  the  Tektronix  screen  and 
reads  a  number  corresponding  to  the  option  list. 

Parameters 

CALL  FETEKSO  (MSG , IVAL , NW , NL , ICLEAR , IRESPN) 

MSG  -  A  dimensioned  array  of  type  integer  or  real  which 
contains  the  character  data  to  be  displayed  as 
the  option  list.  The  character  data  must  con¬ 
sist  of  display  code  characters,  with  each  line 
or  option  choice  starting  a  new  line  of  the 
array.  Formatting  control  characters  should 
not  be  included. 

IVAL  -  A  single  dimensioned  array  of  integer  values  that 
correspond  to  the  count  of  the  number  of  charac¬ 
ters  per  option  description  line. 

NW  -  The  number  of  words  per  row  in  MSG. 

NL  -  An  integer  value  of  the  number  of  option  des¬ 

cription  lines  in  MSG. 

ICLEAR-  0  -  Screen  not  cleared  before  display. 

1  -  Screen  cleared  before  display. 

IRSPN  -  The  value  entered  by  the  user  to  indicate  the 
option  selected,  and  returned  to  the  calling 
program. 


Description 

This  subroutine  provides  a  header  'SELECT  OPTION',  followed 
by  the  character  data  in  MSG.  The  user  must  enter  and  integer 
that  ranges  from  1  to  the  number  of  option  choices.  The  re¬ 
turned  value  typically  could  be  used  directly  in  a  computed 
GO  TO  implemented  CASENTRY  control  figure. 

Example : 

DIMENSION  MENU1(5, 2) , IMCL1(2) 

DATA  IMLCl/32,45/ 

DATA  MENU1(1,1) ,MENU1(2,1) ,MENU1(3,1) ,MENU1(4,1) 

$  /  10H1. DEFINE  0,10HFIELD(S)M,10HUSING  IMAG, 

$  2HES/ 

DATA  MENU1(1,2) ,MENU1(2,2) ,MENU1(3,2) ,MENU1(4.2) 

$  /10H2 .0DEFINE/, 10HMODIFY  FIG,10HLD(S)  WITH, 

$  10H  KEYBOARDS;  5H INPUT/ 

CALL  FETEKSO  (MENU1 , IMLC1 ,5,2,1, ICASE) 

CASENTRY  ( IMAGE-DEFINE , KEYBOARD-DEFINE) 

GO  TO  (10,20)  ICASE 

3.2.4.12  Subrout ine  FEUTLFC 

Title  -  Retrieves,  adds,  or  deletes  specific  records  in  a  field/ 
class-file. 

Parameters 

CALL  FEUTLFC  (FNAME , LBLBFR , IFUNCT , ILOC , FCBFR) 

FNAME  -  The  field/class-file  from/to  which  data  is  to 
be  retrieved,  added,  or  deleted. 

LBLBFR  -  A  supplied  work  area  for  the  label. 
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IFUNCT  -  0  -  Retrieve  record 

1  -  Add  record 

2  -  Delete  record 

ILOC  -  For  IFUNCT  =  0  -  Points  to  the  record  to  be 

retrieved. 

For  IFUNCT  =  2  -  Points  to  the  record  to  be 
deleted . 


FCBFR  -  For  IFUNCT  *  0  -  Retrieved  record  is  placed  in 

this  area. 

For  IFUNCT  =  1  -  Record  to  be  added  is  located 
in  this  area. 

Description 

This  s  ^routine  provides  a  standard  method  to  retrieve,  add, 
or  delete  records  in  a  field/class-file.  ILOC  must  point  to 
the  actual  record  in  the  file,  not  to  the  relative  field 
record,  or  relative  class  record.  This  subroutine  along  with 
FETEKDF  provides  a  complete  capability  to  operate  on  the 
field/class  files. 


3.2.4.13  Subroutine  FEUTLLA 

Title  -  Determines  the  maximum  and  minimum  lines  involved  in  a 
particular  field  definition,  and,  for  a  particular  line 
determines  the  segment (s)  of  the  line  defined  by  the  field. 

Parameters 

CALL  FEUTLLA  (IFCR,IRPIX,L1,L2,LINE,NSEG,NSTRT,NEND) 
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IFCR  -  Input  Field/Class  record 


IRPIX  -  0  -  Determine  min  and  max  line  (LI  &  L2)  only. 

1  -  Determine  segment(s)  of  line  defined  by  the 
Field  IFCR. 


LI  -  Minimum  line  value  defined  by  the  field. 

L2  -  Maximum  line  value  defined  by  the  field. 

LINE  -  Particular  LINE  for  which  segment (s)  defined  by 
the  field  are  to  be  determined. 

NSEG  -  Number  of  segments  defined  by  the  field  for 
this  LINE  (MAX.  12). 


NSTRT  -  A  one-dimensional  array  of  NSEG  starting  segment 
values . 


NEND  -  A  one-dimensional  array  of  NSEG  corresponding 
ending  segment  values. 


Description 

This  subroutine  provides  a  method  to  determine  the  pixel  seg¬ 
ment  (s)  included  in  a  field  (areal  or  linear)  for  any  line 
included  in  the  field. 


3.2.4.14  Subroutine  SYMIN 


Title  -  Inverts  a  symmetric  matrix,  and  calculates  the  determinant 
of  the  matrix. 
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CALL  SYMIN  (A,AI,DET,N) 


Row/column  dimensions  of  A 


Description 

This  subroutine  inverts  a  symmetric  matrix  and  provides  the 
determinant  of  that  matrix.  Presumed  dimensions  are  24  x  24 


3.3  CLASTAT 


CLASTAT  permits  the  user  to  utilize  the  fields  defined  by  FIELDEF  to 
specify  the  location  of  pixels  for  the  gathering  of  signature  statistics 
One  or  more  fields  may  be  specified  for  a  single  signature,  with  all 
of  the  corresponding  pixels  utilized  for  the  statistics.  A  combining 
function  is  provided  to  permit  two  or  more  classes  to  be  combined  into 
a  single  new  class.  And  a  distance  metric  is  provided  in  the  form  of 
the  Bhattacharyya  distance  calculation.  This  provides  an  indication  of 
the  similarity  of  pairs  of  classes. 


The  user  may  interact  with  CLASTAT  primarily  through  simple  option 
selection,  except  for  naming  new  classes  that  are  created. 


The  first  information  the  user  must  enter  is  the  field/class-file  name, 
or  select  one  of  the  available  files  (Figure  3.3.1).  The  next  item 
shown  on  the  Tektronix  is  the  main  option  menu  for  CLASTAT  (Figure  3.3-2). 
This  gives  the  user  the  choice  to  calculate  class  signature  statistics 
from  one  or  more  fields,  to  combine  statistics  from  one  or  more  classes 
into  a  new  class,  to  delete  class  records,  to  calculate  the 
Bhattacharyya  distance  between  two  classes,  and  to  print  class  informa¬ 
tion  on  the  line  printer  from  a  set  of  selected  classes.  If  the  user 
selects  option  1,  he  will  be  asked  to  supply  the  composite-file  name, 
and  the  field  or  fields  to  be  referenced  for  pixel  retrieval  (Figure 
3.3-3).  Subsequent  to  the  calculation  the  user  must  enter  the  new 
class  name  and  description.  If  the  user  selects  option  2  to  combine 
classes,  he  must  select  the  classes  and  then  supply  the  new  name  and 
description.  In  either  case  he  will  be  asked  explicitly  to  save  the 
new  class.  To  delete  classes,  the  user  simply  selects  the  classes 
from  the  class  selection  menu.  To  calculate  the  Bhattacharyya  distance 
between  two  classes  (option  4) ,  the  user  selects  two  classes  from  the 
class  selection  menu  (Figures  3.3-4,  3.3-5),  and  the  result  is  displayed 
as  i,i  Figure  3.3-6.  To  print  class  data  (option  5),  the  user  selects 
the  classes  from  the  class  selection  menu  and  the  results  are  printed 
on  the  line  printer. 

3.3.2  Engineering  Description 

The  primary  function  of  CLASTAT  is  to  gather  mean  and  covariance 
statistics,  by  channel,  from  pixels  specified  by  fields.  More  than 
one  field  may  be  specified,  and  in  this  event  the  statistics  will  be 
gathered  over  all  of  the  pixels  in  all  of  the  fields.  The  resultant 
covariance  matrix  is  stored  in  the  upper  half  of  the  class  record  co- 
variance  array,  while  the  correlation  matrix  is  stored  in  the  lower 
half  of  the  class  record  covariance  array. 


Figure  3.3-1  CLASTAT  Field/Class-Data-File  Select  Menu 
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Figure  3.3-2  CLASTAT  Principle  Options  Menu 


»«- 

111  0> 

Z  CH 
O  3J 
h  h 
I-  CO 
Q_  Zb I 
O  Oh 
HO 

i-  <aui 
o  «j 
u  mui 
_i  mmo 
in  «  in 
m  -jui  i- 
oxui 


U.OO 

CO 


m  o 

OWh 

JwO 

UIOUI 

H  JJ 
ll.  Ill  111 

H«n 
»-  u. 
o  o 

U1Q.~J 

JOUI 

UCH 

maiiL 


Hiun 


r> 

o 


3 

_i 

« 


K 

W 


3-53 


Figure  3.3-3  CLASTA1  ield  Selection  Menu 


Figure  3.3  4  CLASTAT  Class  Selection  Menu 


Figure  3.3-6  Bhattacharyya  Distance  Between  Fieldl  and  Field5 


The  resultant  mean  and  covariance  data  is  utilized  subsequently  by 
MAXLIK  to  "train"  the  maximum  likelihood  classifier,  and  also  the  data 
may  be  used  by  CLUSTER  to  provide  the  mean  values  of  a  set  of  starting 
clusters.  CLASTAT  also  provides  the  capability  to  combine  the  class 
statistics  of  two  or  more  classes  into  a  single  new  class. 

Means  are  calculated  by: 


■c-fS 


where  and  x^  are  n  dimensional  vectors,  and 


y  is  the  mean  for  class  c, 
c 


x^  is  the  individual  pixel  vector, 


N£  is  the  number  of  pixels  in  class  c. 


Covariance  matrices  are  calculated  by: 


Ec  (xk-  yc)  (\-  ycr 


To  combine  2  classes,  the  means  are  calculated  as  follows: 

N  '  y  '  +  N  "  p  " 
c  c _ c  c 

yc  •  N  '  +  N  " 

c  c 


where  U  is  the  resultant  mean  vector, 
c  » 
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N  '  is  the  number  of  pixels  in  the  first  class. 


N^"  is  the  number  of  pixels  in  the  second  class. 


U  '  is  the  first  mean  vector, 
c 


U  "  is  the  second  mean  vector, 
c 


Covariance  matrices  are  combined  as  follows: 


ZN  '  -1  ^  N  * 

c  "  N  ^  c’  +  N  -1 


(u  ’  -  u  )  (u  ’  -  P  + 

c  c  c  c 


N  ”  -1 
c 

fl — =1 

c 


N  M 

S  "  +TT-T  (y  ''  -  vj(v  "  -  vj 

c  N  -l  c  c  c  c 


where  N  *  N  '  +  N  " 
c  c  c 


U  is  the  combined  mean  vector, 
c 


A  measure  of  the  similarity  of  two  classes  is  provided  through  the 
Bhattacharyya  distance  calculation.  The  more  similar  the  two  classes 
are  to  each  other,  the  closer  the  distance  measure  is  to  1,  the  more 
dissimilar,  the  closer  the  measure  is  to  0. 


The  calculation  of  Bhattacharyya  distance  (Ref.  No.  2)  is  as  follows: 


sj 1/4 1 1 r 


-1/8  {T 


3-58 


P  is  the  Bhattacharyya  distance 


Besides  providing  the  capability  to  gather  signature  class  statistics 
combine  class  statistics,  and  calculate  the  Bhattacharyya  distance, 
CLASTAT  also  provides  the  capability  to: 


1.  Delete  class  record (s) 


Print  class  data  (name,  size,  means,  covariance  matrix,  cor¬ 
relation  matrix,  and  source  field  names)  of  selected  classes 
on  the  line  printer. 


3.3.3  CLASTAT  PDL 
CLASTAT : BGNSEGMENT (MAIN) 

CALL  TITLE  TO  DISPLAY  CLASTAT  TITLE  MENU 


*k 


CALL  FETEKFC  TO  SELECT  FIELD/CLASS  FILE 
DO  UNTIL  NO  FURTHER  OPTION  TO  BE  SELECTED 

CALL  FETEKSO  TO  SELECT  1  OF  6  MAIN  OPTIONS 
CAS  ENTRY  (FIELD  STATS,  COMBINE,  DELETE  CLASS, 

BHA TTACHARYYA  DISTANCE,  PRINT  FILE,  END) 

CASE  1  (FIELD  STATS) 

CALL  FETEKSC  TO  SELECT  COMPOSITE  FILE 

CALL  FETEKDF  TO  SELECT  FIELD(S)  FOR  CLASS  STATISTICS 

IF  ONE  OR  MORE  FIELDS  CHOSEN 

THEN 

DO  FROM  1  TO  NUMBER  OF  FIELDS 

CALL  FEUTLFC  TO  RETRIEVE  FIELD 

CALL  FEUTLLA  TO  DETERMINE  MIN  AND  MAX  LINES  IN  FIELD 

COPY  FIELD  NAME  INTO  CLASS  RECORD 

DO  FROM  FIRST  FIELD  LINE  TO  LAST  FIELD  LINE 

CALL  FEUTLLA  TO  OBTAIN  LINE  SEGMENTS  IN  FIELD 
DO  FROM  1  TO  NUMBER  OF  SEGMENTS 

CALL  FECLSUM  TO  ADD  INTERMEDIATE  SUMS 
ENDDO 

ENDDO 

ENDDO 

CALCULATE  MEANS  FOR  EACH  CHANNEL 
CALCULATE  COVARIANCE  MATRIX  (UPPER  HALF) 

CALCULATE  CORRELATION  MATRIX  (LOWER  HALF) 

ENTER  CLASS  NAME 
ENTER  CLASS  DESCRIPTION 

•  END  IF 

CASE  2  (COMBINE) 

CALL  FETEKDF  TO  SELECT  CLASSES  TO  COMBINE  INTO  ONE 

IF  ONE  OR  MORE  CLASSES  CHOSEN 

THEN  CALL  FEUTLFC  TO  RETRIEVE  FIRST  CLASS  RECORD 

DO  WHILE  MORE  RECORDS  TO  COMBINE  AND  NUM  OF  CHANNELS 
ARE  EQUAL 

CALL  FEUTLFC  TO  RETRIEVE  NEXT  CLASS  RECORD 
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IF  NUMBER  OF  CHANNELS  EQUAL 
THEN 

COMBINE  MEANS 
COMBINE  COVARIANCE  MATRIX 
ADD  NUMBER  OF  PIXELS 
COPY  FIELD  NAME(S)  HISTORY 

ELSE 

SET  NUM  OF  CHAN  EQ  SW  TO  NO 
CALL  FETEKEM  TO  DISPLAY  ERROR  MESSAGE 
END  IF 

ENDDO 

IF  NUMBER  OF  CHANNELS  EQUAL 
THEN 

CALCULATE  RESULTANT  CORRELATION  MATRIX (LOWER  HALF) 

ENTER  NEW  CLASS  NAME 

ENTER  NEW  CLASS  DESCRIPTION 

END  IF 

ENDIF 

CASE  3  (DELETE  CLASS) 

CALL  FETEKFC  TO  SELECT  CLASS (ES)  TO  DELETE 
CALL  FEUTLFC  TO  DELETE 
CASE  4  ( BHATTACHARYYA  DISTANCE) 

CALL  FETEKDF  TO  SELECT  2  CLASSES 
IF  NUMBER  OF  CLASSES  EQUAL  2 
THEN 

CALL  FEUTLFC  TO  RETRIEVE  1ST  CLASS 

CALL  FEUTLFC  TO  RETRIEVE  2ND  CLASS 

IF  NUMBER  OF  CHANNELS  EQUAL 

THEN  CALCULATE  BHATTACHARYYA  DISTANCE 

ELSE  CALL  FETEKEM  TO  DISPLAY  ERROR  MESSAGE 

ENDTF 


ENDIF 


CASE  5  (PRINT  FILE) 

CALL  FETEKFC  TO  SELECT  CLASSES  TO  PRINT 


IF  ONE  OR  MORE  CLASSES  CHOSEN 


DO  FROM  1  TO  NUMBER  OF  CLASSES 

CALL  FEUTLFC  TO  RETRIEVE  CLASS  RECORD 

PRINT  NAME, MEANS, COV /COR  MATRIX, AND  FIELD  HISTORY 


ENDDO 


CALL  Q9DISPO  TO  PASS  PRINT  FILE  TO  SYSTEM 


ENDIF 


CASE  6  (END) 

SET  NO  FURTHER  OPTIONS  SW 


ENDCASE 


ENDDO 


ENDSEGMENT 


Subroutine  Description 


3. 3. 4.1  Subroutine  FETEKCL 


For  a  description,  see  section  3. 2. 4. 4 


3. 3.4. 2  Subroutine  FETEKDF 


For  a  description,  see  section  3. 2. 4. 5 


3. 3.4. 3  Subroutine  FETEKEM 


3 . 3 . 4 . 4  Subroutine  FETEKFC 

For  a  description,  see  section  3. 2.4. 7. 

3. 3. 4. 5  Subroutine  FETEKSC 

For  a  description,  see  section  3. 2.4.8. 

3. 3. 4. 6  Subroutine  FETEKSO 

For  a  description,  see  section  3.2.4.11. 

3 . 3 . 4 . 7  Subrout ine  FEUTLFC 

For  a  description,  see  section  3.2.4.12. 

3 . 3 . 4 . 8  Subrout ine  FEUTLLA 

For  a  description,  see  section  3.2.4.13. 

3 . 3 . 4 . 9  Subrout ine  SYMIN 

For  a  description,  see  section  3.2.4.14. 
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3.4  CLUSTER  PROCESSING  MODULE 


CLUSTER  embodies  an  iterative  unsupervised  classification  (clustering) 
process  based  on  Ball  and  Hall's  ISODATA  (Iterative  Self-Chrganizing 
Data  Analysis  Techniques  A)  algorithm  (Ref.  1).  The  algorithm  makes 
multiple  passes  through  the  data,  assigning  data  to  clusters,  and 
splitting  or  combining  the  clusters  so  formed  in  a  sequence  determined 
by  the  user. 


3.4.1  User  Information 

The  user  is  first  requested  to  specify  a  composite  image  containing  at 
least  two  channels,  and  then  to  select  fields  within  this  image.  CLUSTER 
forms  the  rectangular  hull  of  the  set  of  fields  and  it  is  the  data  in 
this  area  which  is  grouped  into  clusters.  If  the  user  has  specified  one 
or  more  COMTAL  displays  in  the  LOGON  procedure,  he  may  then  elect  to 
display  the  image  to  be  clustered.  In  any  event,  the  output  (clustered) 
image  to  be  created  must  be  named.  The  name  can  also  be  annotated  or 
described  with  up  to  40  characters. 

The  user  is  then  presented  with  an  option  menu  from  which  the  subsequent 
computational  path  is  selected.  The  PM  automatically  returns  to  this 
menu  at  the  appropriate  points  in  the  computation.  In  general,  the 
"Cluster"  option  cannot  be  exercised  until  starting  vectors  have  been 
specified,  and  the  "Cluster  Map"  option  cannot  be  exercised  until 
clustering  has  taken  place. 

The  "Modify  Parameters"  option  enables  the  user  to  review  the  values 
of  the  eleven  parameters  which  are  used  at  various  points  in  the  compu¬ 
tation  and  to  specify  the  sequence  of  cluster  splitting  and  cluster 
combining  iterations.  The  eleven  parameters  are  initialized  to  default 


values  when  the  PM  is  called,  and  the  split/combine  sequence  is  empty, 
that  is  if  this  sequence  is  not  changed  by  the  user,  data  would  be 
assigned  to  clusters  and  the  clustering  process  would  terminate  pending 
further  instructions  from  the  user.  Each  of  the  parameters  has  a 
maximum  allowable  value  and  if  the  user  attempts  to  enter  a  value 
greater  than  the  maximum,  the  message  requesting  the  parameter  value 
is  repeated.  Clustering  is  by  its  nature  an  interactive  process  and  a 
certain  amount  of  experimentation  with  the  parameters  is  necessary  to 
obtain  optimal  results. 

The  "Starting  Vectors"  option  enables  the  user  to  specify  cluster 
centers  for  the  initial  assignment  of  data  to  clusters.  There  are 
three  possibilities:  "Zero"  or  self-generated,  which  selects  a  single 
starting  vector  consisting  of  zeros  in  all  channels;  "New,"  which 
allows  the  user  to  select  up  to  50  previously  defined  classes  as 
starting  vectors;  and  "Old"  which  defines  the  starting  vectors  as  the 
clusters  determined  during  the  immediately  preceding  cluster  run  of 
the  present  DIAL  session.  The  "Old"  option  cannot  be  exercised  until 
the  "Cluster"  option  has  been  exercised  at  least  once. 

The  "Cluster"  option  instructs  the  PM  to  perform  the  clustering  compu¬ 
tation.  In  the  first  pass  through  the  data  points  are  assigned  to  the 
nearest  cluster  according  to  one  of  two  distance  measures;  unweighted 
Euclidean  if  IDISF  =  1,  and  (sum  of  absolute  values)  if  IDISF  =  2. 
The  subsequent  computation  depends  on  the  split /combine  sequence  and 
the  value  of  P.  If  P  >  0,  the  program  will  perform  the  iteration  cycle 
of  splitting  clusters  and  reassigning  the  data  points  to  clusters  until 
P  x  100  percent  of  the  clusters  are  stable,  or  PN  iterations  have  taken 
place,  at  which  time  control  is  passed  to  the  split/combine  sequence. 

If  P  *  0,  the  initial  cluster  splitting  is  bypassed  and  control  is 
passed  to  the  split/combine  sequence  after  the  initial  assignment  of 
pixels  to  clusters.  In  either  case,  if  the  split/combine  sequence  is 
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empty,  no  further  clustering  computation  takes  place.  After  each  itera¬ 
tion,  the  resulting  cluster  centers  and  channel  variances  are  available 
for  display.  After  the  last  iteration  of  a  given  sequence,  cluster  popu 
lations  before  and  after  thresholding  and  the  distances  between  cluster 
centers  and  cluster  means  are  displayed.  At  several  points  in  the  com¬ 
putation,  the  process  pauses  to  allow  the  user  to  make  a  hard  copy  of 
the  Tektronix  display.  Computation  resumes  when  the  user  enters  (CR) . 

Once  an  output  (clustered)  image  has  been  created  in  the  "Cluster" 
option,  the  "Results/Cluster  Map"  option  enables  the  user  to  display  a 
pseudo-color  cluster  map  on  one  of  the  COMTAL  displays,  to  display  a 
portion  of  the  clustered  image  on  the  Tektronix  display  in  which  each 
cluster  is  represented  by  a  character  chosen  by  the  user,  and  to  list 
thi  numerical  clustering  results  (percent  of  the  data  points  in  each 
cluster,  etc.). 

Finally,  the  "Exit"  option  terminates  the  PM  .  If  the  Cluster  PM  is 
called  again  during  the  same  terminal  session  (that  is,  without  an  inter 
vening  LOGOFF) ,  the  clustering  parameters  are  set  to  their  default 
values  and  no  cluster  means  are  available  for  the  "Old"  suboption  of 
the  "SeleC' Starting  Vectors"  option,  but  the  names  of  any  output  images 
are  saved,  so  that  new  names  must  be  chosen  for  subsequent  output  images 

The  Tektronix  displays  created  in  a  representative  session  are  shown  in 
Figures  3.4-1  through  3.4-13.  Figure  3.4-1  shows  the  first  two  menus 
the  user  encounters.  Composite-scene-file  ERIMSYN  and  associated 
field-class  file  FCERIM  were  selected.  Figure  3.4-2  is  the  display 
after  fields  1,  2  and  3  were  selected  and  the  "Drop  Fields"  option 
exercised.  The  asterisks  indicate  that  the  corresponding  fields  have 
been  selected;  in  this  run  no  fields  were  dropped  subsequent  to 
selection.  Figure  3.4-3  displays  the  values  of  the  clustering  param¬ 
eters  after  Tl,  T2  and  R2  were  changed  from  their  default  values  to 
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Figure  3.4-1  CLUSTER  Composite  Scene  and  Field/Class-Data-File  Selection  Menu 
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Figure  3.4-2  Display  of  Selected  Fields  Data 
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Figure  3.4-3  CLUSTER  Initial  Parameters  Menu 


the  values  Indicated,  and  a  split /combine  sequence  of  SSSCS  was 
specified.  The  "Zero"  starting  vector  option  was  chosen,  as  Figure 
3.4-4  indicates,  and  the  clustering  computation  proper  initiated. 

Figures  3.4-5  through  3.4-10  show  the  evolution  of  the  clusters  through 
the  course  of  the  specified  split/combine  sequence.  Figure  3.4-11  gives 
a  summary  of  the  cluster  populations  before  and  after  clusters  having 
fewer  than  the  minimum  number  of  pixels  are  eliminated  (in  the  present 
case  none  was  eliminated).  Finally,  Figures  3.4-12  and  -13  illus¬ 
trate  summary  reports  available  in  the  "Reports/Cluster  Map"  option. 

(Of  course,  the  actual  cluster  map  which  is  displayed  on  one  of  the 
COMTALs  is  not  available.)  Not  included  in  the  group  of  figures  are 
displays  which  report  the  percent  of  the  image  classified  at  10%  in¬ 
tervals,  which  keeps  the  user  informed  about  the  progress  of  the  com¬ 
putation;  displays  which  give  the  cluster  number  and  channel  of 
clusters  which  have  been  split,  and  the  cluster  number  of  clusters 
which  have  been  merged;  and  a  summary  of  the  distances  between 
cluster  means  and  centers  and  minimum  distance  statistics. 


3.4.2  Engineering  Description 

As  mentioned  in  Section  3.4.1,  the  CLUSTER  PM  embodies  a  version  of 
Ball  and  Hall's  ISODATA  algorithm,  a  method  of  unsupervised  pattern 
classification.  The  method  is  iterative,  making  a  series  of  passes 
through  the  data  to  assign  the  data  points  (pixels)  to  clusters  on 
the  basis  of  one  of  two  possible  distance  measures,  and  a  sequence  of 
splitting  or  combining  of  clusters  so  formed,  the  cycle  of  computations 
being  dependent  on  parameters  chosen  by  the  user.  The  region  to  be 
clustered  in  the  present  version  of  CLUSTER  is  the  rectangular  hull  of 
the  fields  within  the  composite  image  designated  by  the  user.  The 
composite  image  must  contain  at  least  two  channels,  that  is  each  data 
point  considered  as  a  vector  must  have  at  least  two  dimensions.  All 


Figure  3.4-4  CLUSTER  Starting  Vectors  Display 
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Figure  3.4-6  CLUSTER  Interim  Statistics  Display,  Second  Pass 
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Figure  3.4-7  CLUSTER  Interim  Statistics,  Third  Pass 


Figure  3.4-8a  CLUSTER  Interim  Statistics,  Fourth  Pass 
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Figure  3.4-9  CLUSTER  Interim  Statistics,  Fifth  Pass 


Figure  3.4-10a  CLUSTER  Interim  Statistics,  Sixth  Pass 
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Figure  3.4-10b  CLUSTER  Interim  Statistics,  Sixth  Pass  (continued) 


Figure  3.4-11  CLUSTER  Population  Statistics  Before  and  Eliminating  Small  Clusters 
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Figure  3.4-12a  CLUSTER  Summary  Statistics 
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Figure  3.4-12b  CLUSTER  Summary  Statistics  (continued) 


Figure  3.4-13  CLUSTER  Field  Statistics 


such  vectors  must  have  the  same  number  of  components,  and  corresponding 
components  of  all  of  the  vectors  must  be  of  the  same  type  (e.g.  sensor 
intensity  values  in  a  given  band),  but  different  components  of  a  given 
vector  may  be  of  different  types  (e.g.  components  one  through  four 
might  be  the  respective  MSS  intensities  of  a  pixel  (data  point)  in 
LANDSAT  bands  4  through  7,  and  component  five  might  be  the  elevation  of 
the  pixel  in  meters) . 

The  course  of  a  clustering  computation  is  governed  by  twelve  parameters 
which  are  initialized  to  default  values,  but  whose  values  may  be  changed 
by  the  user.  These  parameters  are: 

1.  T1  -  Threshold  for  cluster  splitting,  in  the  same  units  as  the 
respective  channel  standard  deviations.  If  for  a  given  cluster 
the  standard  deviation  of  a  given  channel  is  greater  than  Tl, 
then  that  cluster  is  a  candidate  for  splitting  (in  the  split 
computation)  along  the  channel  in  question,  if  certain  other 
conditions  are  met  (see  below) . 

2.  T2  -  Threshold  for  cluster  combining  in  the  same  units  as  the 
distance  measure.  If  the  distance  between  clusters  is  less 
than  T2,  the  two  clusters  are  candidates  for  merging  in  the 
combine  computation. 

3.  NMIN  -  Minimum  number  of  data  points  in  a  cluster.  After  the 
last  requested  iteration,  clusters  with  fewer  than  NMIN  points 
are  eliminated  and  the  points  in  those  clusters  are  assigned 
to  the  remaining  clusters  if  the  respective  distances  are 
less  than  R2,  or  to  the  null  cluster  otherwise. 

4.  NVMAX  -  The  maximum  number  of  clusters  allowed. 
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5.  SEP  -  The  scale  factor  which  multiplies  the  ith  channel 
standard  deviation  in  cluster  splitting.  The  resulting  clus¬ 
ters  have  respective  means  of  y.^  ±  SEP*oi,  where  y^  is  the 
mean  of  the  ith  channel  before  splitting,  and  is  the 
standard  deviation  of  the  ith  channel  before  splitting. 

6.  ISODAT  -  Selects  the  distance  measure  used  in  the  cluster  com¬ 
bining  algorithm.  If  ISODAT  ■  0,  the  unweighted  euclidean 
distance  is  used  to  define  the  distance  between  clusters 

i  and  j.  That  is,  if  y^  is  the  component  of  the  k-th  channel 
of  the  ith  cluster  and  each  cluster  has  n  channels. 


(yik  ‘  yjkV 


If  ISODAT  =  1,  the  weighted  euclidean  distance  is  used.  That 
is,  if  is  the  standard  deviation  of  the  kth  channel  of 
the  ith  cluster, 


yik  *  P1k 


aik  *  ajk 


1DISF  -  Selects  the  distance  measure  used  in  assigning  data 
points  to  clusters.  If  IDISF  »  1,  the  unweighted  euclidean 
distance  is  used.  That  is,  if  x^  is  the  component  of  the 
kth  channel  of  the  ith  data  point  and  m^  is  the  kth  component 
of  the  vector  of  the  jth  cluster  centers,  the  distance  D^  be¬ 
tween  the  ith  point  and  the  jth  cluster  center  is 
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If  IDISF  ■  2,  the  so-called  distance  is  used,  that  is 

n 

Pij  *  ^  ;  I  (xik  "  mjk^  I 

k-1 

8.  P  -  The  proportion  (0.  <  P  <  1.0)  of  clusters  which  must  be 
stable  (i.e.  do  not  split)  in  the  "Initial  Split  Iteration" 
option  in  order  for  the  split  computation  to  terminate  before 
PMAX  iterations  (see  10  below).  If  P  =  0.,  control  passes  to 
the  split  combine  sequence  after  the  initial  assignment  of 
pixels  to  clusters  and  there  is  no  initial  series  of  split 
computations. 

9.  R2  -  The  distance  threshold  to  determine  whether  or  not  a 
data  point  (pixel)  is  assigned  to  the  null  class  after  the 
last  requested  iteration.  All  data  points  whose  distance  to 
the  nearest  cluster  is  greater  than  R2  are  assigned  to  the 
null  class. 

10.  PMAX  -  The  maximum  number  of  iterations  allowed  under  the 
"Initial  Split  Iteration"  option.  If  more  than  P  x  100  per¬ 
cent  of  the  clusters  split,  another  split  computation  takes 
place  after  the  data  points  are  assigned  to  the  new  clusters. 
After  PMAX  such  iterations  control  passes  to  the  input  split/ 
confine  sequence  whether  or  not  P  x  100  percent  of  the  clus¬ 
ters  have  remained  stable. 

11.  PM  -  Last  pass  small  cluster  elimination  threshold.  Clusters 
with  fewer  than  PN  +  (the  number  of  channels)  points  are 
eliminated,  and  a  final  reassignment  of  the  points  in  those 
clusters  to  the  remaining  clusters  or  to  the  null  class  is 
made. 


3-86 


12.  SPLIT/COMBINE  SEQUENCE  -  Determines  whether  an  attempt  is  made 
to  split  or  combine  (merge)  clusters  after  data  points  have 
been  assigned  to  clusters  and  the  cluster  means  and  variances 
recomputed . 

Default  values  of  the  first  eleven  parameters  are  initialized  when  the 
CLUSTER  PM  is  called.  The  values  of  Tl,  T2,  and  R2  are  particularly 
data  dependent,  and  it  is  to  be  expected  that  a  certain  amount  of  ex¬ 
perimentation  with  data  of  interest  will  be  necessary  to  determine 
optimal  values  of  these  parameters.  The  split/combine  sequence  is 
initialized  as  "empty,"  which  means  that  no  splitting  or  combining 
of  clusters  is  attempted,  but  only  the  assignment  of  data  points  to 
clusters. 

In  addition  to  the  clustering  parameters,  starting  vectors  or  cluster 
centers  must  be  specified;  three  options  are  provided.  Self-generated 
starting  vectors  may  be  requested  in  which  case  one  cluster  center  with 
zero  values  in  all  of  the  channels  is  selected.  On  the  first  assignment 
of  data  points  to  clusters  all  of  the  points  are,  of  course,  assigned  to 
the  same  cluster.  But  if  split  iterations  are  specified,  in  most  cases 
the  single  cluster  will  be  split,  and  after  several  iterations  meaning¬ 
ful  clusters  will  develop.  It  has  been  found  in  practice  that  ISODATA 
is  not  very  sensitive  to  the  choice  of  starting  vectors.  A  second  option 
is  to  select  as  starting  vectors  previously  defined  classes.  The  third 
option  uses  the  final  values  of  the  cluster  means  in  the  immediately 
preceding  cluster  computation  as  starting  vectors.  In  effect,  the 
present  computation  picks  up  where  the  previous  one  left  off  and  allows 
split/combine  sequences  of  indefinite  length. 

Before  entering  the  CLUSTER  PM  a  composite  image  (one  having  at  least 
two  channels)  to  be  used  in  clustering  must  have  been  built.  This  is 
done  through  the  INTERL  PM,  which  is  described  in  Section  3.1.  Further, 
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a  file  of  fields  in  that  image  must  have  been  created  using  the  FIELDEF 
PM,  Section  3.2.  Upon  entering  the  CLUSTER  PM,  the  user  must  specify 
the  composite  image  and  select  the  fields.  The  data  points  in  the 
rectangular  hull  of  the  selected  fields  will  be  clustered  in  the  sub¬ 
sequent  computation.  The  values  of  the  clustering  parameters  are 
changed  from  their  default  values,  if  desired,  and  starting  vectors 
specified.  These  two  steps  may  be  done  in  any  order. 

In  the  normal  procedure,  the  "cluster"  option  is  selected  next.  As 
many  lines  (records)  of  image  data  as  will  fit  are  read  from  disk  and 
stored  in  ECS.  Then  a  line  at  a  time  is  read  from  ECS,  unpacked,  and 
stored  in  core.  For  each  pixel  (data  point)  in  the  line,  the  distance 
to  each  existing  cluster  is  computed  according  to  the  metric  specified 
by  IDISF.  The  index  of  the  cluster  closest  to  the  pixel  and  the  dis¬ 
tances  to  that  pixel  are  recorded  and  stored  in  interleaved  fashion  on 
disk  as  the  output  image.  The  output  image  may  be  thought  of  as  a 
two  channel  image  which  corresponds  pixel  by  pixel  with  the  image  to  be 
clustered,  the  first  channel  containing  the  distance  to  the  nearest 
cluster  and  the  second  channel  containing  the  number  of  the  nearest 
cluster.  The  process  continues  until  all  of  the  lines  in  ECS  have  been 
treated.  If  the  image  to  be  clustered  cannot  fit  into  ECS,  the  next 
group  of  image  lines  is  read  into  ECS  up  to  its  capacity,  and  the  proc¬ 
ess  continued  until  all  of  the  pixels  of  the  image  have  been  assigned. 
The  channel  means  and  variances  are  then  computed  for  each  of  the 
clusters  formed. 

The  subsequent  computational  path  depends  on  the  value  of  P  and  the 
split /combine  sequence  specified  by  the  user.  If  P  =  0,  the  split/ 
combine  sequence  is  tested  on  each  iteration  to  determine  whether  a 
split  (S)  or  combine  (C)  iteration  is  specified.  In  a  split  iteration, 
the  cluster  having  the  largest  channel  variance  is  determined.  Two  new 
cluster  centers  are  formed  from  this  cluster  center  by  respectively 
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adding  and  subtracting  SEP  (channel  standard  deviation)  to  the  mean  of 
that  channel,  provided: 


•  The  channel  standard  deviation  is  greater  than  Tl; 

•  The  population  of  the  cluster  under  consideration  is  greater 
than  2*(NMIN  +1); 


•  The  number  of  clusters  after  splitting  would  not  exceed  NVMMAX. 

The  cluster  which  has  just  been  split  is  removed  from  consideration, 
and  the  cluster  having  the  next  largest  channel  variance  determined, 
tested,  and  if  the  tests  are  passed,  split.  The  process  continues 
until  all  candidates  for  splitting  have  been  examined.  In  a  combine 
iteration,  the  distance  between  cluster  centers  is  computed  according 
to  the  unweighted  euclidean  metric  if  ISODAT  =  0  or  the  weighted 
euclidean  metric  if  ISODAT  =  1.  Clusters  closer  together  than  T2  are 
merged  by  forming  a  new  cluster  center  according  to  the  formula 


m 


j  n'  +  n"  Mj 


n'  +  n' 


where  m^  is  the  jth  component  of  the  new  cluster  center  vector,  y ^  '  and 
y^"  are  the  jth  channel  means  of  the  cluster  to  be  merged,  and  n'  and 
n"  are  their  respective  populations.  Following  a  split  or  combine 
computation,  pixels  are  reassigned  to  clusters,  and  new  cluster  channel 
means  and  channel  variances  computed. 


Following  the  initial  assignment  of  pixels  to  clusters,  the  cycle  of 
splitting  or  combining  of  clusters  and  reassignment  continues  until 
the  split/combine  sequence  is  exhausted.  If  the  split/combine 
sequences  has  the  default  values,  i.e.  is  empty,  the  cycle  is  completed 


immediately  after  the  first  assignment  of  pixels  to  clusters.  At  this 
point  the  cluster  populations  are  tested  and  each  cluster  whose  popula¬ 
tion  is  less  than  PN  +  (the  number  of  channels)  is  eliminated,  and  a 
final  assigment  of  pixels  to  clusters  made.  Those  pixels  whose  distance 
to  the  nearest  cluster  center  is  greater  than  R2  are  assigned  to  the 
null  class,  that  is  they  are  not  considered  to  belong  to  any  cluster. 

The  other  principal  computation  path  is  followed  where  P  >  0.  In  that 
case,  following  the  initial  assignment  of  pixels  to  clusters,  a  cycle 
of  iterations  is  carried  out  before  control  is  passed  to  the  input 
split/combine  sequence.  In  each  iteration,  the  ratio  of  clusters 
having  more  than  PN  +  (number  of  channels)  pixels  whose  maximum  channel 
standard  deviation  is  greater  than  Tl,  to  the  total  number  of  clusters 
in  that  class,  is  computed.  If  the  ratio  is  greater  than  P,  a  cluster 
splitting  computation  is  performed  and  pixels  reassigned  to  the  new 
set  of  cluster  centers.  The  iteration  cycle  continues  until  the  ratio 
of  unstable  clusters  is  less  than  P,  or  PMAX  iterations  have  taken 
place,  whichever  condition  occurs  first.  At  this  point  control  is 
passed  to  the  input  split/combine  sequence.  The  initial  split  itera¬ 
tion  option  is  particularly  useful  in  dealing  with  data  for  which  a 
split /combine  sequence  which  can  be  expected  to  produce  useful  results 
has  not  yet  been  developed.  In  such  cases  the  algorithm  allows  the 
formation  of  clusters  to  continue  until  the  situation  stabilizes,  at 
which  time  examination  of  the  results  may  suggest  what  course  of 
action  (further  splitting,  combining,  or  termination  of  the  process) 
to  follow. 

To  assist  the  user  in  deciding  on  further  courses  of  action,  cluster 
means  and  variances  are  available  for  display  on  the  Tektronix  (and 
consequently  for  hard  copying)  after  each  iteration.  After  the  final 
iteration  of  a  sequence,  cluster  populations  before  and  after 
elimination  of  small  clusters  are  available,  as  well  as  the  distance 
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between  the  cluster  means  and  cluster  centers  and  the  minimum  distance 
statistics.  Through  the  "Cluster  Map  Reports"  option  a  pseudocolor 
display  of  the  clustered  image  (the  cluster  map)  can  be  put  up  on  one 
of  the  COMTALS.  A  fixed  assignment  of  pseudocolor  to  cluster  number 
is  built  into  the  subroutine.  Cluster  report  summaries  can  also  be 
displayed  on  the  Tektronix  in  this  option. 

3.4.3  CLUSTER  PDL 

INPUTS  -  USER  SELECTED  CLUSTERING  PARAMETERS 

AND  STARTING  VECTORS  (ENTERED  THROUGH 
TEKTRONIX  TERMINAL) 

-  COMPOSITE  IMAGE  DATA  SET  (FROM  DISK) 

-  FIELD  DATA  SET  (FROM  DISK) 

OUTPUTS  -  CLUSTER  OUTPUT  IMAGE  DATA  SET  (TO  DISK) 

-  CLUSTER  STATISTICS 
CLUSTER : BGNSEGMENT (MAIN) 

TEKDISPLAY  CLUSTER  TITLE  MENU 
TEKDISPLAY  COMPOSITE-SCENE-FILE  LIST 
SELECT  COMPOSITE-SCENE-FILE 

CALL  FETEKFO 

TEKDISPLAY  FIELD  -FILE-NAME  LIST 
SELECT  FIELDS  FOR  CLUSTERING  AND  POSSIBLE  DISPLAY 
CALL  FETEKDF 

DETERMINE  MINIMUM  RECTANGULAR  HULL 
CALL  RECFLD 

READ  COMPOSITE  IMAGE  LABEL  AND  VALIDATE  SUBIMAGE  TO 
BE  CLUSTERED 
IF  IMAGE  DISPLAY  OPTION 
THEN  DISPLAY  IMAGE  TO  BE  CLUSTERED 


CALL  DSPCLA 

CREATE  OUTPUT  IMAGE  FILE 
CALL  KREARI 

DO  WHILE  ANOTHER  OPTION  TO  BE  SELECTED 
TEKDISPLAY  OPTION  MENU 
SELECT  OPTION 

CALL  FETEKSO 

CASENTRY  (MODIFY, STARTING  VECTORS, CLUSTER, MAP, EXIT) 

CASE  1  (MODIFY) 

DO  WHILE  CLUSTERING  PARAMETERS  TO  BE  MODIFIED 
SELECT  OPTION  (ONE  OF  TWELVE  PARAMETERS) 

CALL  FETEKSO 
MODIFY 

ENDDO 

ENDCASE 

CASE  2  (SELECT  STARTING  VECTORS) 

DO  WHILE  STARTING  VECTORS  TO  BE  SELECTED 
SELECT  OPTION  (ZERO, OLD, NEW) 

STARTING  VECTORS  SELECTED  AND  DISPLAYED 

ENDDO 

ENDCASE 

CASE  3  (CLUSTER) 

DO  WHILE  SPLIT/COMBINE  ITERATIONS  REMAIN  TO  BE  PERFORMED 
ASSIGN  PIXELS  TO  CLUSTERS  AND  BUILD  OUTPUT  IMAGE 
DO  WHILE  IMAGE  LINES  REMAIN  TO  TRANSFERRED  TO  ECS 
CALL  FTLECS 

DO  WHILE  IMAGE  LINES  IN  ECS  REMAIN  TO  BE  CLUSTERED 
CALL  READEC 
CALL  UNPKF 
CALL  CLUS2 
CALL  PACKF 
CALL  WRITEC 
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ENDDO 

CALL  EMTECS 

ENDDO 

CALL  D CLOSE 

SPLIT  OR  COMBINE  CLUSTERS 
CALL  CESCA 
IF  LAST  ITERATION 

THEN  ELIMINATE  SMALL  CLUSTERS,  REASSIGN  PIXELS  TO 
REMAINING  CLUSTERS 
ENDDO 
ENDCASE 

CASE  4  (REPORTS, CLUSTER  MAP) 

CALL  RMENU 
ENDCASE 
CASE  5  (EXIT) 

SET  EXIT  FLAG 
ENDCASE 

ENDDO 

CALL  FINIS 
END 


3.4.4  Subroutine  Description 


The  principal  subroutines  called  by  CLUSTER  are  as  follows: 


3. 4. 4.1  Subroutine  CESCA 


Title  -  Cluster  elimination,  splitting,  and  combining  algorithm 


Parameters 


CALL  CESCA  (ECHAR,VMP,VAR,ND,NVM,NMIN,T1,T2,SEP,IS0DAT, 
NVMMAX.IT) 


ICHAR  -  Left  justified  Hollerith  S  or  C  which  determines 
whether  clusters  are  to  be  split  or  combined 

VMP  -  Array  of  cluster  channel  means 

VAR  -  Array  of  cluster  channel  variances 

ND  -  Number  of  channels 

NVM  -  Number  of  clusters 

NVMIN  -  Minimum  cluster  population  parameter  for  cluster 
splitting 

T1  -  Cluster  splitting  channel  standard  deviation 
threshold 

T2  -  Cluster  combining  intercluster  distance 

threshold 

SEP  -  New  cluster  channel  mean  scaling  factor 

ISODAT  -  Selects  combining  algorithm 

NVMMAX  -  Maximum  number  of  clusters  allowed  in  cluster 
splitting 

IT  -  Iteration  number 
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Description 

This  subroutine  performs  cluster  splitting  or  combining,  de¬ 
pending  on  the  value  of  ICHAR.  Cluster  splitting  finds  the 
cluster  having  the  largest  channel  variance  and  splits  the 
cluster  along  that  channel,  providing  certain  other  condi¬ 
tions  are  met.  The  two  new  cluster  centers  are  removed  from 
the  competition  and  the  algorithm  finds  the  cluster  from  the 
remaining  group  having  the  largest  channel  variance.  The 
cycle  continues  until  no  candidate  clusters  remain.  The 
cluster  combining  algorithm  finds  a  pair  of  clusters  whose 
distance  is  less  than  the  threshold.  A  new  cluster  center 
is  formed  from  the  average  of  the  two  weighted  by  their  re¬ 
spective  populations.  The  merged  clusters  are  removed  from 
the  competition  and  the  cycle  repeated  with  the  remaining 
clusters  until  no  candidate  clusters  remain. 


3.4.4. 2  Subroutine  CLUS2 


Title  -  Assigns  pixels  to  clusters  a  line  at  a  time 


Parameters 

CALL  CLUS2(IT, ITL,IDISF,R2,NVLL) 

COMMON /RES IMG /NAMER , NBITR , NWP  SR , NL V , NP IXC 

COMMON /PROCP 1EM/K0PTP , NCHAN , NCLUS , NSLC , NELC , NSPC , LEPC , NWECS 

COMMON/STATS / VM, VMP , VAR , NVG , RMINM , RMINV , VSUM1 , VSUM2 

IT  -  Iteration  number 

ITL  -  Total  number  of  iterations 


IDISF  -  Selects  distance  metric 
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R2  -  Last  iteration  distance  threshold 


I  r 


I 


1 


NULL  -  Null  class  population 
NPIXC  -  Number  of  pixels  in  a  line 

NCHAN  -  Number  of  channels 

NCLUS  -  Number  of  clusters 

VM  -  Array  of  cluster  centers  (NCHAN*NCLUS  components) 

NVG  -  Array  of  cluster  populations 

RMINM  -  Array  of  means  of  distance  to  nearest  cluster 

RMINV  -  Array  of  variances  of  distance  to  nearest  cluster 

VSUM1  -  Working  array  to  accumulate  channel  sums 

VSUM2  -  Working  array  to  accumulate  channel  sums  of 

squares. 

(Other  COMMON  variables  not  required  in  this  subroutine.) 

Description 

For  a  line  of  image  data  CLUS2  computes  the  distance  of  each 
pixel  in  turn  to  each  of  the  clusters  and  enters  the  minimum  distance 
and  cluster  number  for  which  it  is  attained  in  the  output  image  array. 
Minimum  distance  statistics  are  accumulated  and  running  sums  of  channel 
means  and  variances  computed. 


3-96 


3.4.4. 3  Subroutine  DSPCLA 


For  a  description,  see  Section  3. 5. 4. 3 


3. 4. 4. 4  Subroutine  EMTECS 


For  a  description,  see  Section  3. 5. 4. 5 


3. 4. 4. 5  Subroutine  FETEKDF 


For  a  description,  see  Section  3. 2.4. 5 


3. 4. 4. 6  Subroutine  FETEKFC 


For  a  description,  see  Section  3. 2.4. 7 


3.4.4. 7  Subroutine  FEUTLFC 


For  a  description,  see  Section  3.2.4.12 


3. 4. 4. 8  Subroutine  FILECS 


For  a  description,  see  Section  3. 5.4.6. 


3. 4. 4. 9  Subroutine  KREARI 

For  a  description,  see  Section  3. 5. 4. 7. 


3.4.4.10  Sub rout ine  PACK 

Title  -  Array  packing  routine. 

Parameters 

CALL  PACK(V,ND,NV,IND) 

V  -  On  input,  array  to  be  packed;  on  output,  packed 
array. 

ND  -  Dimension  of  vectors  in  array  V. 

NV  -  Number  of  vectors  in  array  V. 

IND  -  Index  of  vector  to  be  removed. 

Description 

PACK  forms  an  array  of  ND*(NV-1)  elements  in  which  all  elements 
of  the  original  array  whose  index  is  greater  than  IND*  ND  are 
moved  up  ND  positions. 


3.4.4.11  Subroutine  RECFLD 


For  a  description,  see  Section  3.5.4.11. 


3.5  MAXLIK  PROCESSING  MODULE 

The  maximum  likelihood  (MAXLIK)  processing  module  (PM)  assigns  each 
observation  vector  in  a  subimage  to  the  class  with  the  smallest  value 
of  the  likelihood  function.  Results  of  the  classification  can  be 
displayed  on  the  COMTAL,  Tektronix  or  printed  on  the  line  printer. 


3.5.1  MAXLIK  User  Information 

Before  entering  MAXLIK,  the  user  must  have  defined  fields  (see  Section 
3.2  FIELDEF),  defined  classes  (see  Section  3.3  CLASTAT)  and  have  a 
composite  image  on  the  assigned  disk  pack  (see  Section  3.1  INTERL) . 
After  completing  the  above  MAXLIK  can  be  initialized  in  one  of  two 
ways.  First  by  just  typing  in  "MAXLIK"  and  second,  if  the  composite 
image  name  and  field/class  data  set  name  is  known,  by  typing  in 
"MAXLIK,  Image  Name,  Field/Class  Data  Set  Name." 

If  the  first  way  were  used  the  PM  would  request  the  user  to  select  a 
composite  image  and  field/class  data  set  from  the  library  on  the  disk 
pack.  Once  MAXLIK  has  been  initiated  and  the  data  sets  attached, 
the  user  will  be  presented  with  the  following  requests  and  menus. 

Field  Selection 

The  fields  to  be  classified  are  selected  at  this  time,  all 
pixels  interior  to  the  minimum  rectangle  enclosing  the  fields 
will  be  classified,  and  the  population  of  each  class  will  be 
calculated  for  each  field  selected.  The  selection  procedure 
begins  with  the  menu  given  below. 
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Select  Option 

1.  Select  Field (s)  Fields  to  classified — 25  Max — 

2.  Drop  Field (s)  From  the  selected  list 

3.  Field  Selection  completed 

(Enter  a  value  from  1  to  3)  1 

Where  the  response  should  be  a  "1." 

The  second  menu  presented  contains  a  description  of  the  fields,  an 
example  of  which  is  given  in  Figure  3.5.1.  The  required  response  is 
the  numbers  of  the  fields  wanted.  When  all  the  fields  have  been 
selected  and  reviewed,  the  select  option  menu  will  appear  and  a 
response  of  "3"  will  terminate  the  field  selection  process. 

Display  of  Subimage  to  be  Classified 

This  request  determines  if  one  band/channel  of  the  subimage  that 
is  to  be  classified  should  be  displayed.  The  menu  is  given  below. 
Do  you  want  to  display  image  to  be  classified  (Y/N)?  Y 
Enter  input  image  display  name  B1 
Enter  channel  number  1  thru  3  to  be  displayed  3 
The  channel  number  n  refers  to  nth  image  that  was  used  in  building 
the  composite  image. 

Class  Selection 

The  classes  into  which  every  pixel  in  the  subimage  will  be 
assigned  are  now  selected.  The  procedure  begins  with 

SELECT  OPTION 

1.  Select  Class (es)  for  classification — 10  Max — 

2.  Drop  Class (es)  from  the  selected  list 

3.  Class  selection  completed 

(Enter  a  value  from  1  to  3)  1 

Where  the  response  should  be  a  "1." 
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Figure  3.5-1  Field  Description 


Next  a  description  of  the  available  classes  is  presented,  an  example 
of  which  is  given  in  Figure  3.5.2.  The  class  selection  termination 
proceeds  as  that  of  field  selection. 

Selection  of  Class  Parameters 

Three  parameters  are  selected  for  each  class.  They  are  color, 
apriori  weight  and  character:  color  selection  is  required  and 
the  others  are  optional.  Color  is  obtained  by  first  presenting 
a  set  of  pseudocolor  tables  and  asking  the  user  to  select  one. 
This  table  is  displayed  and  the  user  is  requested  to  position 
the  cursor  over  the  color  that  is  to  represent  the  class  and 
to  press  the  selection  done  button.  This  is  done  for  all  classes 
plus  the  null  class.  See  Figure  3.5.3  for  an  example  of  the  menu 
and  the  responses.  For  the  apriori  weight  and  character  the 
class  name  is  displayed  with  a  request  for  a  weight  or  character. 
A  typical  case  is  given  below  where  apriori  weights  were  not 
wanted  but  characters  were  wanted. 

SELECTION  OF  CLASS  PARAMETERS 

Do  you  want  class  apriori  weights  (Y/N)?  N 

Do  you  want  to  assign  a  character  to  a  class  (Y/N)?  Y 


Enter  character  for-treesky  S 
Enter  character  for-treeroad  R 
Enter  character  for-treegrass  G 
Enter  character  for-null  class  N 


Processing  Parameters 

The  next  set  of  requests  are  for  how  the  data  is  to  be  processed 
and  for  describing  the  class  map  image  that  will  be  created. 
Classically,  the  kernel  of  the  maximum  likelihood  classifier 
uses  the  class  covariance  matrix,  but  to  reduce  computation 
time  approximations  to  the  covariance  matrix  have  been  used. 
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Figure  3.5-2  Class  Prescription 
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Figure  3.5-3  Class  Pseudocolor 


There  are  three  approximations  to  the  covariance  matrix  incor¬ 
porated  in  MAXLIK,  any  of  which  can  be  selected.  Processing 
modes  describe  where  and  when  the  processing  will  be  done.  Only 
mode  number  1  is  currently  available.  The  name  for  the  output 
class  map  and  the  A/N  specification  conform  to  the  DIAL  standards. 
See  Figure  3.5-4  for  a  typical  menu  and  responses.  At  this  point 
the  actual  maximum  likelihood  classification  takes  place. 

Presentation  of  Results 

When  the  classification  process  has  been  completed,  the  results 
menu  given  in  Figure  3.5-5  will  be  displayed.  The  user  then 
enters  the  number  of  the  option  he  wants  and  those  results  are 
presented.  After  the  option  selected  has  completed  its  task, 
the  results  menu  is  redisplayed  until  a  9,  "Terminate  Results 
Processing,"  is  entered. 

1.  LIST  OVERALL  SUB IMAGE  RESULTS 

MLC  results  for  the  subimage  are  displayed  (see  Figure 
3.5-6  for  an  example).  The  sub image  coordinates  are 
with  respect  to  the  composite  image.  The  population 
numbers  are  the  number  of  pixels  in  the  subimage  assigned 
to  the  class.  The  means  and  sigmas  are  means  and  stand¬ 
ard  deviations  of  all  the  pixels  in  that  class.  This 
example  was  for  a  composite  image  having  three  channels 
and  three  classes. 

2.  DISPLAY  CLASS  MAP 

This  option  displays  the  class  map  image.  Each  pixel 
is  displayed  in  the  color  of  the  class  that  it  was 
assigned  to.  This  can  also  include  the  null  class  if 
thresholding  has  been  performed.  The  user  is  also 
asked  if  the  color/class  assignment  is  to  be  reviewed. 
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Figure  3.5-4  Processing  Parameters 


NAXLIK  ie;  oi  ":  us.fs.rc. 
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AVAILABLE  pesults  options  are 


1  LIST  OUERALL  SuBIHAGE  RESULTS 

2  DISPLAY  CLASS  NAP 

3  ANNOTATE  RUN  RESULTS 

4  THRESHOLD  RESULT  DISTANCE  NETRICS 

5  ASSIGN  CHARACTER  TO  CLASS 

6  DISPLAV  COMPOSITE  SUBINAGE 

7  LIST  FIELD  RESULTS 

8  LIST  A  SUBAREA  OF  CLASS  NAP 

9  TERMINATE  RESULTS  PROCESSING 

ENTER  OPTION  NUMBER 


Figure  3.5-5  Results  Options 


Figure  3.5-6  MLC  Sub image  Results 
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1 

l 

1 
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If  yes,  Chen  the  class  name  and  the  color  assigned  are 
displayed  on  the  COMTAL. 

3.  ANNOTATE  RUN  RESULTS 

Up  to  eighty  characters  of  run  annotation  data  can  be 
entered  and  this  data  will  be  displayed  and  printed  with 
all  subsequent  results. 

4 .  THRESHOLDING 

Thresholding  is  performed  by  calculating  the  cumulative 
distribution  of  the  distance  metrics  and  requesting  the 
user  to  enter  the  percent  of  the  distribution  to  be 
assigned  to  the  null  class.  All  pixels  that  exceed  the 
threshold  determined  by  the  above  information  are 
assigned  to  the  null  class  and  a  new  class  map  is  created 
having  the  same  name  as  the  prior  class  map.  The  re¬ 
quests  and  results  of  a  computation  are  presented  in 
Figure  3.5-7.  In  this  case  the  request  was  to  assign 
51%  of  the  pixels  to  the  null  class.  This  resulted  in 
having  50.93%  of  the  pixels  assigned  to  the  null  class 
and  the  other  pixels  assigned  as  follows:  23.3%  to  the 
first  class,  7.2%  to  the  second  class,  18.6%  to  the  third 
class.  The  percent  is  with  respect  to  the  subimage. 

5.  ASSIGN  CHARACTER  TO  CLASS 

The  class  name  is  presented  to  the  user  with  a  request 
for  a  character  to  represent  that  class.  Each  class 
requires  a  unique  character  and  if  a  duplicate  character 
is  entered,  the  program  will  request  another  character 
for  that  class.  The  characters  are  used  when  Option  8 
is  selected. 
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Figure  3.5-7  Thresholding 


6.  DISPLAY  OF  COMPOSITE  SUBIMAGE 

If  this  option  is  selected,  one  channel  of  the  composite 
subimage  image  is  displayed.  The  same  procedure  as  given 
under  Section  3. 5. IB  is  used. 

7.  LIST  FIELD  RESULTS 

Field  results  are  the  number  of  pixels  and  percent  of 
the  field  assigned  to  each  class.  These  results  are 
displayed  on  the  Tektronix  for  every  field  selected 
during  the  field  selection  process.  An  example  of  a 
field  results  display  is  given  in  Figure  3.5-8.  After 
the  title,  the  first  line  gives  the  composite  image  name, 
the  second  line  gives  run  annotation  information,  and 
the  third  line  gives  the  threshold  value.  After  this 
individual  field  results  are  displayed  and  in  this 
example  there  are  three  fields,  TREESKY,  TREEROAD,  and 
TREEGRASS,  and  four  classes;  namely,  TREESKY,  TREEROAD, 
TREEGRASS  and  NULL.  The  names  of  classes  and  fields 
are  selected  by  the  user.  Also  the  user  can  have  the 
field  results  sent  to  the  printer. 

8.  LIST  A  SUBAREA  OF  CLASS  MAP 

This  option  gives  the  user  the  capability  to  display  on 
the  Tektronix  and  send  to  the  printer  a  100  pixel  by  40 
line  area  of  class  assignments.  This  option  works  in 
conjunction  with  Option  5  where  a  character  was  assigned 
to  each  class.  The  character  which  was  assigned  to  the 
class  is  displayed  for  every  pixel  classified  into  that 
class.  The  subarea  is  determined  by  having  the  user 
position  the  cursor  over  the  center  of  the  area  of 
interest  and  depressing  the  "select"  or  "done"  button 
which  defines  the  center  of  a  100  pixel  by  40  line 
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Figure  3.5-8  Field  Results 


rectangle.  As  an  example,  consider  Figure  3.5-9  where 
the  header  information  is  self  explanatory.  The  sub image 
coordinates  are  given  in  the  header  data  and  the  line 
numbers  (350-389)  and  pixel  numbers  (168-258)  are  given 
with  respect  to  the  subimage  and  not  the  composite  image. 

9.  TERMINATE  RESULTS  PROCESSING 

Selecting  this  option  will  terminate  the  MAXLIK 
processing  module. 


3.5.2  MAXLIK  Engineering  Description 

The  MAXLIK  processing  module  embodies  the  maximum  likelihood  and  Bayes 
classification  algorithms.  These  supervised,  parametric  classifica¬ 
tion  techniques  require  class  mean  vectors  (u^)  and  covariance  matrices 
(1^),  and  in  addition  the  user  may  specify  a  set  of  probabilities  which 
represent  prior  knowledge  of  the  relative  frequencies  of  the  different 
classes. 

An  observation  x  is  classified  into  the  class  with  the  smallest  value  of 
Lfe(x)  -  log  | Efc l  +  (x  -  yk)T  Ek  1  (x  -  uk)  -  2  log  pfc  (1) 

where 

j  Ek )  •  determinant  of  covariance  matrix  for  kth  class 

yk  ■  mean  vector  for  kth  class 

I,  ■  covariance  matrix  for  kth  class 
k 

Pk  ■  apriori  probability 
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OR  T0  RETURN  to  RESULTS  HENII  ENTER 


This  is  the  most  widely  used  supervised  parametric  algorithm,  mainly 
because  it  maximizes  the  proportion  of  correctly  assigned  observations 
of  data  from  multivariate  normal  distributions. 

When  prior  knowledge  of  relative  frequencies  of  the  classes  is  not 

available,  MAXLIK  employs  a  maximum  likelihood  classifier.  It  has 

the  same  form  as  the  Bayes  classifier,  equation  (1),  except  that  the 

term  -2  log  p^  is  not  present.  This  classifier  is  optimal  if  all  of 

the  prior  probabilities  are  equal.  To  reduce  the  computation  time 

necessary  to  evaluate  equation  (1) ,  the  user  can  select  approximations 

to  the  covariance  matrix.  These  approximations  replace  (1^)  by  (1) 

the  corresponding  diagonal  matrix,  (2)  a  constant  times  the  identity 
2 

matrix  a  I,  and  (3)  by  the  identity  matrix.  These  approximations 
reduce  the  number  of  operations  per  class  per  pixel  from  a  quadrutic 
to  a  linear  function  of  the  dimension  of  the  observation  vector. 

A  class  map  image  is  the  primary  output  of  MAXLIK.  It  is  a  two 
dimensional  image,  each  pixel  is  represented  by  two  8  bit  quantities. 

The  first  8  bits  is  used  for  the  quadratic  term  in  the  likelihood 
function 

(x  -  (x  -  uk) 

while  the  other  eight  are  for  the  class  number.  This  is  the  vehicle 
used  for  calculating  field  results,  thresholding  and  displaying  results. 

3.5.3  MAXLIK  Control  Flow 

This  section  presents  the  control  flow  and  processing  steps  of  MAXLIK. 
The  steps  are  described  in  general  forms  and  the  major  subroutines 
identified.  These  subroutines  are  described  in  the  next  section. 
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CONTROL  FLOW/PROCESSING  STEP 

SUBROUTINES  USED 

1. 

Initialize  MAXLIK  processing 

2. 

Request  composite  image 

3. 

Request  fields  to  be  classified 

FETEKDF 

4. 

Determine  rectangular  subimage  encompassing 

all  fields 

RECFLD 

5. 

Validate  sub image 

6. 

Display  channel  of  subimage 

DSPCLA 

7. 

Calculate  distance  scale  factor 

8. 

Request  classes  for  classification 

FETEKDF 

9. 

Request  parameters  for  classes 

CLASDA 

10. 

List  processing  options 

11. 

Calculate  and  reformat  class  signatures 

SIGPREP 

12. 

List  processing  modes 

13. 

Start  MLC  processing  of  subimage 

14. 

Fill  ECS  with  set  of  image  lines 

FILECS 

15. 

Get  a  line  from  ECS 

16. 

Unpack  line 

17. 

Perform  MLC  computation  for  each  pixel  in  the 

line 

18. 

Scale  distance  results 

19. 

Pack  results  image  line 

20. 

Write  results  image  line  to  ECS 

21. 

Write  results  image  lines  in  ECS  to  disk 

EMTECS 

22. 

Return  to  14  if  all  lines  in  subimage  have 

not  been  processed  or  continue  at  23. 

23. 

Display  results  menu 

RMENU 

24. 

List  overall  subimage  results 

RES SUB 

25. 

Display  classification  map 

DSPMAP 

26. 

Annotate  classification  run  results 

27. 

Threshold  result  image  distance  metrics 

THRESD 

28. 

Assign  character  to  class 

ASSCHR 
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29. 

Display  channel  of  subimage 

DSPCLA 

30. 

Display  on  Tektronix  field  results 

FLDRES 

FEUTLLA 

31. 

Display  on  Tektronix  a  subarea  of  class  map 

LSTMAP 

32. 

Terminate  MAXLIK  processing 

3.5.4  MAXLIK  Subroutine  Description 

All  of  the  subroutines  developed  for  MAXLIK  are  described  in  this 
section  with  the  exception  of  FETEKDF  and  FEUTLLA  which  are  described 
in  Section  3.2.4. 

3. 5. 4.1  Subroutine  ASSCHR 

Purpose 

Assign  a  character  to  a  particular  class  or  cluster 


Usage 

CALL  ASSCHR (NCLASS , CNAME , CLASCH) 

Description  of  Parameters 

NCLASS  -  Integer  parameter  number  of  classes/clusters 

CNAME  -  Array  names  of  classes,  each  name  4  words  stored 
modulo  4 

CLASCH  -  Array  class  characters  one  word  for  each  class/ 
cluster  stored  in  display  code,  left  justified. 

Common  Areas  Used 
None 
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3. 5.4. 2  Subroutine  CLASDA 


I 

I 

I 

l 

I 


1 

I 

I 


Purpose 

Associates  a  pseudocolor,  a  weight  and  a  character  with 
each  class. 


Usage 

CALL  CLASDA  ( IREC , NCLS , IERROR) 

Description  of  Parameters 

IREC  -  An  input  array  containing  the  class  record  numbers 

NCLS  -  Input  variable  containing  the  number  of  classes 

IERROR  -  Output  error  parameter 
=0  no  errors 
=1  label  error 

=2  cannot  find  record  requested 
=3  DREAD  error 

Subroutines  Used 

CURSDEF , CURSXY , DREAD , DSPLAY , DSPNME , GALPHA , GSET , 

IMGECS , IMGLINE , LBLRD , LNXYPE, PACKF , PACKI , PSEUDO , 

SETCORE , TEKMSG , TEKRD , TERWRIT , T ITLE , WRITEC 

Method 

Pseudocolors  are  associated  with  classes  by  first  generating 
an  image  containing  all  64  values,  one  per  PC  table  entry. 

PC  tables  are  used  to  display  this  image.  When  a  PC  table 
has  been  selected,  the  user  is  requested  to  position  the 
cursor  over  the  color  wanted  for  a  particular  class.  As 
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colors  are  selected,  they  are  displayed  in  the  order  of 
selection.  Characters  and  weights  are  entered  via  the 
Tektronix. 


Common  Areas  Used 

CLASS,  ECS,  LINE,  PROGRRM,  RES IMG 


3.5.4. 3  Subroutine  DSPCLA 

Purpose 

Display  sub image  that  is  to  be  classified 


Usage 

CALL  DSPCLA  (A, B,  IERROR.NAMEC ,NBITC ,NCHAN,NWDSC ,NSPC , 
NSLC , NP IXC , NLO ) 

Description  of  Parameters 

A  -  Buffer  used  for  temporary  storage  must  be  large 

enough  to  hold  packed  composite  image  line 

B  -  Buffer  used  for  temporary  storage  (same  size  as 

buffer  A) 

NAMEC  -  Four  word  array  having  composite  image  name 

NBITC  -  Integer  parameter  -  number  of  bits/pixel 

NCHAN  -  Integer  parameter  -  number  of  channels  In  image 

NWDSC  -  Integer  parameter  -  number  of  60  bit  wds/record 


Integer  parameter  -  starting  pixel  number  of  subimage 


NSPC 


Integer  parameter  -  starting  line  number  of  subimage 


NSLC 


NPIXC 


IERROR  -  Output  error  parameter 
■0  No  errors 
■I  Find  error 


2  DREAD  error 


Method 


Based  upon  the  channel  selected,  the  composite  image  is 
sampled  and  displayed 


COMMON  Areas  Used 


3. 5. 4. 4  Subroutine  DSPMAP 


Display  class  map  and  class  color  association 


Usage 


CALL  DSPMAP  (NAME, NACLAS, A, NUMCL, IERROR) 


Description  of  Parameters 

NAME  -  Input  array  with  image  map  name 


NACLAS  -  Input  array  with  class  names  each  name  is  4  words  in 
length 


Input  array  used  for  label  buffer 


NUMCL  -  An  integer  value  specifying  the  number  of  classes 


IERRQR  -  Output  error  parameter 
■0  No  errors 
■1  error  in  label 


2  error  in  DREAD 


3  error  in  number  of  lines 


Method 


The  image  generated  by  CLUSTER  or  MAXLIK  is  displayed  with  the 
function  memory  on  and  the  pseudocolor  memory  on.  Before  the 
image  is  sent  to  the  COMTAL  the  image  is  sampled  to  remove  the 
chi**2  value  and  adjusted  for  the  image  size.  Also  upon 
request  the  class  color  correspondence  is  displayed. 


COMMON  Areas  Used 


3. 5. 4. 5  Subroutine  EMTECS 


Take  the  packed  chi**2  and  class  map  lines  from  ECS  and  write 
them  to  disk 


CALL  EMTECS  (A.LINECS) 


Description  of  Parameters 

A  -  Array  used  for  temporary  storage,  must  be  NWDSQ 

words  long 

LINECS  -  Input  number  of  lines  to  be  written  to  disk. 

Remarks 

Packed  lines  in  ECS  have  interspersed  values:  first  chi**2  then 
the  class  number. 

COMMON  Areas  Used 
RES IMG, ECS 

3. 5. 4. 6  Subroutine  FILECS 

Purpose 

Fill  ECS  with  lines  from  the  composite  image 

Usage 

CALL  FILECS  (LBC, LINECS, IERROR) 

Description  of  Parameters 

LBC  -  Input  line  being  classified  relative  to  subimage 

LINECS  -  Output  parameter  giving  number  of  lines  stored  in  ECS 

IERROR  -  error  parameter 
-0  no  errors 
-3  FIND  error 
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Remarks 


Packed  lines  from  the  composite  image  on  disk  are  stored  in 
ECS.  The  number  of  words  from  the  first  word  of  one  line  to 
the  first  word  of  the  next  line  is  the  parameter  IECSS. 


COMMON  Areas  Used 

ECS , COMIMG , PROCPRM , RESIMG , LINE . 


3. 5. 4. 7  Subroutine  KREARI 

Purpose 

Create  the  output  class  map  image  and  store  appropriate 
parameters  in  the  label 


Usage 

CALL  KREARI  (IFLAGC) 

Description  of  Parameters 

IFLAGC  -  Flag  having  the  following  meaning 
»1  MLC  processing 
■2  CLUSTER  processing 


Method 

Generate  the  standard  label  and  move  data  from  COMMON  and 
store  it  in  the  photogramme trie  parameter  record.  Also  a 
function  memory  and  pseudocolor  memory  are  stored. 

COMMON  Areas  Used 


CLASS,  COMIMG,  FIELDS,  LINE,  PROCPRM,  RESIMG 


3.S.4.8  Subroutine  FLORES 


Purpose 

Present  Cluster/Class  results  on  a  per  field  bases 

Usage 

CALL  FLORES  (NAMEFC , NFLDS , FLDNAM , NAMER , NAMEC , IANNO , CNAME , 

N CLASS , DTHRES , IDOR, IFLAGC , IBUF 1 , IBUF2) 

Description  of  Parameters 

NAMEFC  -  4  word  array  -  name  of  field/class  file 

NFLDS  -  Integer  parameter  -  number  of  fields 

FLDNAM  -  Array  names  of  fields  each  name  4  words  stored  modulo  4 

NAMER  -  4  word  array  -  name  of  results  image 

NAMEC  -  4  word  array  -  name  of  composite  image 

IANNO  -  Array  of  10  words  describing  run 

CNAME  -  Array  -  names  of  classes,  each  name  4  words  stored 
modulo  4 

NCLASS  -  Integer  parameter  -  number  of  classes/clusters 
DTHES  -  Fit.  pt.  parameter  -  percent  of  pixels  thresholded 
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ICOR  -  4  word  array  sub image  definition 
ICORR(l)  ■  lowest  pixel  number 
IC0RR(2)  ■  highest  pixel  number 
IC0RR(3)  ■  lowest  line  number 
IC0RR(4)  ■  highest  line  number 

IFLAGC  -  Processing  flag 
-1  MAXLIK 
-2  CLUSTER 

IBUF1  -  Array  used  as  a  buffer  (5K  words) 

IBUF2  -  Array  used  as  a  buffer  (IK  words) 

Method 

This  routine  reads  the  file  (NAMEFC)  to  get  field  description. 
It  then  reads  the  results  image  to  determine  the  population  of 
each  class  for  the  field.  These  results  are  displayed  on  the 
Tektroniz  along  with  field  outlines  on  the  COMTAL. 

COMMON  Areas  Used 
ECS 


3. 5. 4. 9  Subroutine  LSTMAP 

Purpose 

Displays  on  Tektronix  a  100  pixel  by  40  line  area  of  class 
assignments 

Usage 

CALL  LSTMAP  (NAMEC.NAMER,  IANNO,  CLASCH,IC0R,0THRES,IBUP1) 
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Description  of  Parameters 

NAMEC  -  4  word  array  -  name  of  composite  image 

NAMER  -  4  word  array  -  name  of  results  Image 

IANNO  -  Array  of  10  words  describing  run 

CLASCH  -  Array  -  class  characters,  one  word  for  each  class/ 
cluster,  stored  in  display  code  left  justified. 

ICOR  -  4  word  array  -  subimage  definition 
ICORR(l)  *  lowest  pixel  number 
IC0RR(2)  -  highest  pixel  number 
IC0RR(3)  ■  lowest  line  number 
IC0RR(4)  *  highest  line  number 

DTHES  -  Fit.  pt.  parameter  -  percent  of  pixels  thresholded 

Method 

The  user  is  asked  to  position  the  cursor  over  the  center  of 
the  area  where  detailed  pixel  assignment  is  to  be  displayed  on 
the  Tektronix  and  printed. 

COMMON  Areas  Used 
None 


3.3.4.10  Subroutine  MLC2 

Purpose 

Performs  maximum  likelihood  classification  on  one  image  a  line 
at  a  time 
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Usage 


CALL  MLC2 

Description  of  Parameters 

ALINE  -  Input  array  of  pixel  values,  floating  point  format. 

Upon  exiting  array  contains  chi**2  value  and  chosen 
class  number.  They  are  interwoven  starting  with 
chi**2  value. 

Remarks 

The  upper  triangular  part  of  the  matrix  is  assumed  to  be  stored 
row-wise  in  N*  (N+l)/2  successive  storage  locations.  The  second 
class  matrix  is  stored  similarly  starting  in  the  next  location. 
Mean  vectors  are  also  stored  successively.  The  procedure  can 
be  used  for  NCHAN  .GT.  2  but  .LT.  25.  and  NCLASS  .LE.  10. 

Subroutines  Used 
None 

Method 

Classification  is  done  by  maximum  likelihood.  Based  upon  option 
selected  the  full  covariance  matrix  is  used,  or  the  diagonal,  or 
the  expected  value  of  the  trace  times  the  identity,  or  just  the 
identity  matrix 

COMMON  Areas  Used 


RESIMG,  LINE,  PROCPRM,  STATS 


3.5.4.11  Subroutine  RECFLD 


Determine  the  minimum  rectangular  subimage  encompassing  all  the 
fields  to  be  classified 


Usage 


CALL  RECFLD  (IRECS .NAME, ICORR, TERROR) 


Description  of  Parameters 

IRECS  -  An  input  array  containing  the  record  number  of  the 
fields  to  be  classified 


Output  array  containing  the  sub image  coordinates 

ICORR(l)  -  min  line 

IC0RR(2)  ■  min  pixel 

IC0RR(3)  "  max  line 

IC0RR(4)  *  max  pixel 


ICORR 


IERROR  -  Output  error  parameter 
•0  no  errors 
■1  label  error 
-2  DREAD  error 


Method 


All  requested  field  records  are  retrieved  from  the  data  set  and 
the  minimum  and  maximum  vertices  are  selected 


COMMON  Areas  Used 


FIELDS,  LINE 


3.5.4.12  Subroutine  RMENU 

Purpose 

Displays  the  available  results  options 

Usage 

CALL  RMENU  (NAMER , NAMEC , NAMEFC , NFLOS , FLONAM, NCLASS , CNAME , 
CLASCH, ICOR,NCHAN,FMEAN,FSD,IPOP,  IFLAGC, 
NBITC , HPIXT , IBUF 1 , IB JF2 , IBUF2 ) 

Description  of  Parameters 

NAMER  -  4  word  array  -  name  of  results  image 

NAMEC  -  4  word  array  -  name  of  composite  image 

NAMEFC  -  4  word  array  -  name  of  field/class  file 

NFLDS  -  Integer  parameter  -  number  of  fields 

FLDNAM  -  Array  names  of  fields,  each  name  4  words  stored 
modulo  4 

NCLASS  -  Integer  parameter  -  number  of  classes/clusters 

CNAME  -  Array  names  of  classes,  each  name  4  words  stored 
modulo  4 

CLASCH  -  Array  class  characters,  one  word  for  each  Class/ 
Cluster,  stored  in  display  code  left  justified 
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ICOR 

NCHAN 

FMEAN 

FSD 

IPOP 

IFLAGC 

NBITC 

NPIXT 

IBUF1 

IBUF2 


4  word  array  subimage  definition 
ICORR(l)  *  lowest  pixel  number 
IC0RR(2)  *  highest  pixel  number 
IC0RR(3)  *  lowest  line  number 
IC0RR(4)  ■  highest  line  number 

Integer  parameter  -  number  of  channels 

Floating  point  array  -  Cluster/Class  means,  NCHAN 
values  per  Cluster/Class  stored  modulo  NCHAN 

Floating  point  array  -  Cluster/Class  standard  deviations, 
stored  like  FMEAN. 

Array:  Class/Cluster  population  per  sub image 

Processing  flag 
-1  MAXLIK 
-2  CLUSTER 


Integer  parameter  number  of  bits/pixel  in  composite 
image 

Integer  parameter  number  of  pixels  per  line  of  com¬ 
posite  image 

Array  used  as  a  buffer  (5K  words) 

Array  used  as  a  buffer  (IK  words) 


IBUF3  -  Array  used  as  a  buffer  (IK  words) 


Remarks 

This  routine  asks  the  user  for  the  type  of  output  wanted  and 
calls  the  appropriate  subroutines 

COMMON  Areas  Used 
None 

3.5.4.13  Subroutine  RESSUB 

Purpose 

Displays  on  the  Tektronix  the  overall  subimage  results 

Usage 

CALL  RESSUB  (NAMEC , NAMER , I ANNO , CLASCH , CNAME , ICOR , FMEAN ,  F  SD , 
IPOP , NCHAN , IFLAGC , NCLASS ) 

Description  of  Parameters 

NAMEC  -  4  word  array  -  name  of  composite  image 

NAMER  -  4  word  array  -  name  of  results  image 

IANNO  -  Array  of  10  words  describing  run 

CLASCH  -  Array  of  class  characters,  one  word  for  each  Class/ 
Cluster,  stored  in  display  code  left  justified 

CNAME  -  Array  -  names  of  classes,  each  name  4  words  stored 
modulo  4 
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ICOR  -  4  word  array  -  sub  image  definition 
ICORR(l)  ■  lowest  pixel  number 
IC0RR(2)  »  highest  pixel  number 

IC0RR(3)  =  lowest  line  number  \ 

IC0RR(4)  *  highest  line  number 

FMEAN  -  Floating  point  array  -  cluster/class  means,  chan 
values  per  cluster/class,  stored  modulo  NCHAN 

FSD  -  Floating  point  array  -  cluster/class  standard 
deviations  stored  like  FMEAN 

IPOP  -  Array  -  class/cluster  population  per  subimage 

NCHAN  -  Integer  parameter  -  number  of  channels 

IFLAGC  -  Processing  flag 
-1  MAXLIK 
*2  CLUSTER 

NCLASS  -  Integer  parameter  -  number  of  classes/clusters 

Remarks 

This  routine  displays  on  the  Tektronix  class  population  values 
and  statistics  of  all  the  pixel  vectors  in  the  subimage  assigned 
to  the  class. 

COMMON  Areas  Used 
None 
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3.5.4.14  Subroutine  SIGPREP 


Purpose 

Retrieves  class  signatures  and  prepares  them  for  MLC  computation 


Usage 

CALL  SIGPREP  (IPRINT, IERROR) 

Description  of  Parameters 

A  -An  array  used  for  temporary  storage 

IPRINI  -  Input  parameter  indicating  if  signatures  should  be 
printed 

=>0  do  not  print 
*1  print  signatures 

IERROR  -  Output  error  parameter 
*0  no  errors 
*1  label  error 
=2  DREAD  error 

*3  number  of  channels  between  classes  not  consistent 

■4  matrix  inversion  problem 

-5  not  all  signatures  are  available 


Remarks 

The  class  signatures  which  are  stored  on  disk  in  a  name  param¬ 
eter  data  set  are  retrieved  and  for  each  class  the  determinant 
and  the  inverse  of  the  covariance  matrix  are  calculated.  The 
off  diagonal  elements  of  the  inverse  are  modified  to  increase 
computational  speed  and  only  the  upper  triangular  portion  of 
the  matrix  is  stored.  How  che  results  are  stored  in  main 
memory  is  based  upon  the  processing  option. 
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COMMON  Areas  Used 

CLASS,  LINZ,  PROCPRM,  STATS 

3.5.4.15  Subroutine  STDCAL 

Purpose 

Calculates  the  class  mean  and  standard  deviation 

Usage 

CALL  STDCAL  (IPOP,FMEAN,FSD,NCHAN,NCLASS) 

Description  of  Parameters 

IPOP  -  Array  -  class/cluster  population  per  subimage 

FMEAN  -  Floating  point  array  -  cluster/class  means,  NCHAN 
values  per  cluster/class  stored  modulo  NCHAN 

FSD  -  Floating  point  array  -  cluster/class  standard 
deviations  stored  like  FMEAN. 

NCHAN  -  Integer  parameter  -  number  of  channels 

NCLASS  -  Integer  parameter  -  number  of  classes/clusters 

Method 

The  unnormalized  first  and  second  moments  are  input  and  the 
routine  calculates  the  mean  and  standard  deviation  and  stores 
the  results  back  in  the  input  arrays  (FMEAN, FSD). 

COMMON  Areas  Used 
None 
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3.5.4.16  Subroutine  SYMFAC 


Purpose 

Factor  a  given  symmetric  positive  definite  matrix 

Usage 

CALL  SYMFAC  (A,N,EPS,IER) 

Description  of  Parameters 

A  -  Upper  triangular  part  of  the  given  symmetric  positive 

definite  N  by  N  coefficient  matrix.  On  return  A 
contains  the  resultant  upper  traingular  matrix 

N  -  The  number  of  rows  (columns)  in  the  given  matrix. 

EPS  -  An  input  constant  which  is  used  as  relative  tolerance 

for  test  of  loss  of  significance 

IER  -  Resulting  error  parameter  coded  as  follows 
IER-0  -  No  error 

IER>=1  -  No  result  because  of  wrong  input  parameter  N 
or  because  some  radicand  is  nonpositive 
(matrix  A  is  not  positive  definite,  possibly 
due  to  loss  of  significance) 

IER-K  -  Warning  which  indicates  loss  of  significance. 

The  radicand  formed  at  factorization  step  K+l 
was  still  positive  but  no  longer  greater  than 
ABS  (EPS-A(K+1,  K+l)). 
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Remarks 

The  upper  triangular  part  of  the  given  matrix  is  assumed  to  be 
stored  columnwise  in  N*(N+l)/2  successive  storage  locations. 

In  the  same  storage  locations  the  resulting  upper  triangular 
matrix  is  stored  columnwise  too.  The  procedure  gives  results 
if  N  is  greater  than  0  and  all  calculated  radicands  are  positive. 
The  product  of  returned  diagonal  terms  is  equal  to  the  square 
root  of  the  determinant  of  the  given  matrix. 

Subroutines  and  Function  Subprograms  Required 
None 

Method 

Solution  is  done  using  the  square  root  method  of  Cholesky. 

The  given  matrix  is  represented  as  product  of  two  triangular 
matrices. 

COMMON  Areas  Used 
None 


3.5.4.17  Subroutine  SYMINV 

Purpose 

Invert  a  symmetric  positive  definite  matrix 


Usage 

CALL  SYMINV  ( A, N , EPS , DET , IER) 

Description  of  Parameters 

A  -  Upper  triangular  part  of  symmetric  positive  definite 

N  by  N  matrix.  On  return  contains  resultant  upper 
triangular  matrix. 
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N  -  The  number  of  rows  (columns)  in  given  matrix. 

DET  -  Determinant  of  matrix. 

EPS  -  An  input  constant  which  is  used  as  a  relative  tolerance 
for  test  of  loss  of  significance 

IER  -  Resulting  error  parameter  coded  as  follows 
IER*0  -  No  error 

IER*-1  -  No  result  because  of  wrong  input  parameter  N 
or  because  some  radicand  is  nonpositive 

IER-K  -  Warning  which  indicates  loss  of  significance. 
The  radicand  formed  by  factorization  step 
K+l  was  still  positive  but  no  longer  greater 
than  ABS(EPS*A(K+1,  K+l)). 

Remarks 

The  upper  triangular  part  of  the  given  matrix  is  assumed  to  be 
stored  columnwise  in  N*(N+l)/2  successive  storage  locations. 

In  the  same  storage  locations  the  resulting  upper  triangular 
matrix  is  stored. 

The  procedure  gives  results  if  N  is  greater  than  0  and  all 
calculated  radicands  are  positive. 

Subroutines  Used 
SYMFAC 

Method 


Solution  is  done  by  upper  triangular  factorization 


COMMON  Areas  Used 


None 

3.5.4.18  Subroutine  THRESD 

Purpose 

Creates  a  new  results  image  with  all  pixel  distance  metrics 
exceeding  the  threshold  assigned  to  the  null  class 

Usage 

CALL  THRESD  (NAMER.NCLASS .DTHRESS , IBUF1 , IBUF2 , IBUF3) 

Description  of  Parameters 

NAMER  -  4  word  array  -  name  of  results  image 

NCLASS  -  Integer  parameter  -  number  of  classes/clusters 

DTHES  -  Fit.  pt.  parameter  -  percent  of  pixels  thresholded 

IBUF1  -  Array  used  as  a  buffer  (5K  words) 

IBUF2  -  Array  used  as  a  buffer  (IK  words) 

IBUF3  -  Array  used  as  a  buffer  (IK  words) 


Method 

The  user  enters  the  percent  of  pixels  to  be  assigned  to  the 
null  class.  The  program  determines  the  cumulative  distribution 
of  the  distances  and  selects  the  appropriate  threshold.  Then 
each  pixel  with  a  distance  greater  than  the  threshold  is  assigned 
to  the  null  class.  A  new  results  image  is  generated  with  the 
new  assignments. 


COMMON  Areas  Used 


Section  4 
EXPERIMENT  RESULTS 


4. 1  OVERVIEW 

Classification  experiments  were  performed  on  three  multi-dimensional 
image  data  sets  and  these  experiments  constituted  the  testing  of  the 
feature  extraction  software  package.  They  had  two  purposes:  (1)  to 
verify  that  the  PMs  which  make  up  the  package  and  the  constituent 
functions  of  the  PMs  work  properly,  and  (2)  to  validate  the  accuracy 
of  the  versions  of  the  supervised  (maximum  likelihood)  and  unsuper¬ 
vised  clustering  algorithms  implemented  in  the  package.  The  three 
image  data  sets  were  a  LANDSAT  scene  containing  the  LACIE  intensive 
study  site  in  Hand  County,  South  Dakota,  a  synthetic  data  set  used 
in  the  thematic  mapper  design  parameters  investigation  (LANDSAT-D) , 
and  an  Indiana  scene  taken  with  the  ERIM  6-channel  aircraft  scanner. 
Because  ground  truth  was  not  available  for  the  Indiana  scene  the 
results  are  not  included  in  the  report. 


4.2  HAND  COUNTY  DATA  SET 


This  set  is  four-channel  Landsat  MSS  imagery  containing  a  5  x  6  nautical 
mile  LACIE  Intensive  site  in  a  LANDSAT  scene  of  100  x  100  nautical 
miles.  Each  of  the  four  bands  in  the  scene  was  transferred  to  disk  as 
an  image  file  and  a  composite  image  built  through  the  INTERL  PM.  This 
image,  2340  x  3240  pixels,  was  displayed  on  a  COMTAL,  and  by  successive 
use  of  the  image  expansion  facility  of  the  FIELDEF  PM,  a  512  x  512  pixel 
region  containing  the  5x6  nautical  mile  ground  truth  area  was  defined. 


It  should  be  appreciated  that  locating  such  a  small  area  in  a  large 
image  is  a  nontrivial  matter,  since  the  area  does  not  have  any  particu¬ 
larly  distinctive  features,  and  since  large  scale  (1:24,000)  maps  of 
the  region  of  interest  were  not  available  at  the  time  classification 
was  carried  out.  A  standard  road  map  in  which  the  Landsat  scene  could 
be  outlined  and  which  had  county  boundaries  was  available,  which,  it 
turned  out,  was  all  that  was  needed  in  conjunction  with  image  expansion. 
A  512  x  512  pixel  region  containing  the  ground  truth  area  in  its  in¬ 
terior  was  defined,  and  a  four-channel  composite  image  called  HANDCO 
was  built  using  INTERL  for  subsequent  field  definition,  signature 
computation,  maximum  likelihood  classification,  and  clustering. 

4.2.1  Supervised  Classification 

The  ground  truth  area  with  the  exception  of  about  2%  of  the  area  con¬ 
sists  of  spring  wheat,  oats,  com,  and  pasture,  the  proportions  of 
each  being  known  from  the  ground  inventory.  Using  this  inventory,  a 
sample  page  of  which  is  shown  in  Figure  4.2-1,  fields  in  the  four 
classes  were  selected  and  then  located  in  the  ground  truth  overlay. 
Figure  4.2-2.  With  the  aid  of  the  overlay,  some  seventeen  fields 
were  defined  in  the  SELECT  FIELDS  option  of  FIELDEF  including  the 
area  to  be  classified  (named  GRTRUTH)  which  contains  17,394  pixels, 
four  wheat  fields,  four  oat  fields,  three  com  fields,  and  three 
pasture/grass  fields.  These  are  shown  in  Figure  4.2-3.  The  same 
fields  except  for  GRTRUTH  were  then  designated  as  classes  (Figure 
4.2-4),  signatures  computed  (channel  means  and  covariance  matrices) 
and  five  classes  defined  for  classification  purposes  as  follows 
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Figure  4.2-3a  Hand  County  Training  Fields 
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Figure  4.2-3b  Hand  County  Training  Fields  (continued) 
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Figure  4.2-4a  Hand  County  Classes 
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Figure  4.2-4b  Hand  County  Classes  (continued) 
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Figure  4.2-4c  Hand  County  Classes  (continued) 


OATS 


OATS247,  OATS  192,  OATS226,  OATS38 


PASTURE  P76,  P274,  P105 

CORN  CORN 80,  CORN  100,  CORN54 


A  preliminary  classification  was  performed  by  the  MAXLIK  PM  yielding 
the  results  shown  in  Table  4.2-1,  where  the  percentages  of  the  prin¬ 
cipal  classes  derived  from  the  ground  truth  inventory  are  also  shown 
for  comparison. 


IMAGE 

AREAL 

PROPORTION 

ESTIMATES  (%) 

CORN 

OATS 

PASTURE 

SPRING  WHEAT 

WHEAT 

GROUND  TRUTH  MAP 

8.06 

6.47 

81.23 

4 

.24 

HANDCO 

11.04 

3.27 

59.98 

15.26 

10.46 

Table  4.2-1  Preliminary  Classification  Results 


While  this  preliminary  classification  demonstrated  that  the  PMs  are 
operating,  it  is  not  a  very  satisfactory  result.  It  is  clear  that 
the  two  wheat  classes  are  significantly  overclassified,  while  pasture 
is  underclassified.  Examination  of  the  results  indicated  that  pasture 
pixels  were  being  classified  as  wheat,  and  a  further  set  of 
Bhattacharyya  distances  indicated  that  class  W204  was  very  distinct 
from  SW236,  and  that  0ATS226  was  distinct  from  the  remaining  oat 
classes.  Accordingly  new  classes  were  defined  to  reflect  these 
observations  as  follows: 


I 


%  of  CLASSES 


SPRING  WHEAT 

SW236,  SW263 

6 

(4) 

CORN 

C0RN80 ,  CORN 110,  C0RN211 

18 

(8) 

OATS 

OATS38,  OATS 2 74 

10 

(6) 

PASTURE 

P76 ,  P274 

66 

(80) 

(Ideally,  these  classes  should  be  on  the  same  proportion  as  in  the 
total  inventory  (shown  in  parentheses) ,  but  it  is  not  always  possible 
to  identify  training  fields  of  the  appropriate  type  to  be  combined 
into  classes  of  the  desired  proportion) . 


A  second  classification  with  MAXLIK  yielded  the  results  of  Table  4.2-2. 
The  results  of  another  classification  using  the  ERMAN-II  system  are 
given  for  comparison. 


IMAGE 


AREAL  PROPORTION  ESTIMATES  (%) 


CORN 

OATS 

PASTURE 

SPRING  WHEAT 

n 

GROUND  TRUTH  MAP 

8.06 

6.47 

81.23 

4.24 

1* 

HANDCO 

11.04 

9.27 

61.10 

18.60 

I* 

ERMAN-II 

4ft  • 

CLASSIFICATION 

9.35 

8.70 

80.52 

1.42 

•• 

Table  4. 

2-2  Results 

of  Second 

Classification 

Run 

•  • 

An  examination  of 

the  results 

shows  that 

the  training  fields  OATS247, 

CORN80,  P76,  CORN110,  and  OATS38  each  is  classified  100%  in  its  own 
class,  but  that  5%  of  P274  is  classified  as  spring  wheat,  and  11%  of 
SW236  and  3%  of  SW263  is  classified  as  pasture.  What  is  happening  is 
that  a  small  percentage  of  pasture  is  being  classified  as  spring  wheat, 
and  a  small  percentage  of  wheat  is  being  classified  as  pasture.  Since 
the  ground  truth  area  is  80%  pasture,  the  number  of  pasture  pixels 
classified  as  spring  wheat  is  far  greater  than  the  number  of  spring 
wheat  pixels  as  classified  as  pasture. 
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The  important  point  is  that  these  effects  are  inherent  in  the  data,  not 
in  the  algorithm  or  its  implementation.  The  Hand  County  classification 
exercise  confirms  that  the  supervised  classification  portion  of  the 
feature  extraction  package  is  functioning  properly. 


4.2.2  Unsupervised  Classification 

The  CLUSTER  PM  was  exercised  with  the  Hand  County  data  in  two  modes. 

In  the  first,  the  four  classes  employed  in  the  MAXLIK  exercise  were 
taken  as  starting  vectors  (cluster  centers),  while  in  the  second,  self¬ 
generated  (zero)  starting  vectors  were  used.  The  results  of  the  second 
mode  will  be  described  here  since  it  is  representative  or  the  most  com¬ 
mon  use  of  unsupervised  classification,  that  is  in  situations  where 
there  is  no  prior  information.  A  split /combine  sequence  of  SSSCS  was 
chosen,  and  T1  and  T2  set  to  2.0  and  4.0  respectively,  the  remaining 
clustering  parameters  being  at  their  default  values.  The  split/combine 
sequence  chosen  yielded  seven  clusters  plus  the  null  cluster  (to  which 
only  0.4%  of  the  pixels  were  assigned).  A  convenient  way  of  summarizing 
and  evaluating  the  results  is  to  determine  the  clusters  into  which 
pixels  of  each  of  the  four  classes  were  assigned.  This  is  done  in  Table 
4.2-3  (certain  clusters  with  a  small  percentage  of  pixel  assignment  were 


CLASS 

CLUSTEP 

%  OF  CLASS 

OATS 

1 

51 

8 

48 

CORN 

3 

6 

6 

90 

SPRING  WHEAT 

3 

29 

4 

71 

PASTURE 

2 

9 

3 

3 

4 

65 

7 

20 

Table  4.2-3  Assignment  of  Classes  to  Clusters 


ignored  in  this  summary) .  To  determine  the  reasonableness  of  this 
result,  the  channel  means  of  Cluster  1  may  be  compared  with  a  training 


field  known  to  be 

oats,  cluster  6  with 

corn,  and  cluster 

4  with  spring 

wheat  and  pasture. 

This  done  on  Table 

4.2-4.  When  it  is 

recalled  that 

CLUSTER  1 

OATS 24 7 

CLUSTER  6 

CORN 54 

24.53 

24.78 

17.64 

16.89 

32.37 

30.04 

18.39 

17.25 

39.13 

39.24 

29.40 

28.67 

30.72 

30.78 

21.85 

21.64 

CLUSTER  4 

SW263 

P105 

20.40 

19.97 

20.27 

23.12 

24.80 

22.56 

33.24 

31.73 

32.25 

25.53 

24.17 

25.34 

Table  4.2-4  Cluster  and  Training  Field  Means 

the  clusters,  and  therefore,  their  means,  were  generated  by  a  process 
which  is  completely  independent  of  the  selection  of  training  fields, 
the  agreement  between  supervised  and  unsupervised  classification  must 
be  considered  quite  good.  Note  that  the  confusion  between  spring  wheat 
and  pasture  is  corroborated. 

In  an  actual  application,  further  effort  would  be  made  to  locate  fur¬ 
ther  spring  wheat  fields  and  perhaps  pastures  in  order  to  improve  the 
separation  of  the  two  classes.  However,  from  the  point  of  view  of  testing 
the  feature  extraction  package,  the  results  obtained  confirm  its  satis¬ 
factory  operation. 
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4.3  SYNTHETIC  DATA  SET 


The  synthetic  data  used  in  the  second  series  of  tests  of  the  feature 
extraction  package  was  derived  from  a  set  consisting  of  six  bands  each 
of  1,650  lines  and  442  samples  which  provided  field  sizes  of  2.5,  5,  10, 
and  20  acres,  with  form  factors  (width  to  length  ratio)  of  1  x  1,  1  x  2, 
and  1x4.  Spectral  signatures  of  six  classes,  namely  trees,  com,  pas¬ 
ture,  winter  wheat,  soybeans,  and  soil  were  derived  from  covariance 
matrices  obtained  from  aircraft  data  of  an  actual  region  comprising 
those  classes.  This  set  was  resampled  to  simulate  satellite  multi- 
spectral  imagery,  yielding  a  data  set  of  154  lines  and  54  pixels  in 
bands  1  through  5  and  38  lines  and  13  pixels  in  band  6  to  simulate  an 
infra-red  sensor.  Since  band  6  has  too  few  pixels  for  adequate  field 
definition,  only  bands  1  through  5  were  used.  After  transference  to 
disk,  these  were  combined  in  the  INTER!  PM  to  build  a  composite  image 
called  SYNDATA. 

4.3.1  Supervised  Classification 

Although  detailed  "ground  truth"  data  is  available  for  the  original 
synthetic  data  set,  time  limitations  prevented  its  use  in  defining 
training  fields,  which  would  have  involved  long  and  tedious  identifi¬ 
cation  of  pixel  coordinates.  Instead  five  training  fields  simply 
designated  FIELD1  through  FIELD5  were  identified  visually  and  designated 
as  classes,  along  with  a  class  corresponding  to  the  wedge  shaped  borders 
along  the  left  and  right  hand  vertical  edges  of  the  image  designated 
BACKGROUND. 


Supervised  classification  was  carried  out  with  MAXLIK.  Individual  field 
results  showed  that  100%  of  the  pixels  in  each  of  the  six  training 
fields  were  classified  into  their  respective  classes  (all  of  the  56 


pixels  in  field  FI  were  assigned  to  class  FIELD1,  all  of  the  54  pixels 
in  field  FIELD2  were  assigned  to  class  FIELD2,  etc.)  thus  justifying 
the  training  field  definition. 

Thresholding  was  applied  to  the  resulting  class  map  at  the  30%  level, 
which  resulted  in  the  assignment  of  29.94%  of  these  pixels  to  the  null 
class.  The  resulting  class  map  shows  very  clearly  that  the  null  class 
is  primarily  the  boundary  line  (simulating  roads  or  bare  soil)  between 
the  fields.  Polaroid  photographs  of  the  COMTAL  displays  of  the  origi¬ 
nal  data  set  and  the  threshold  data  set  are  shown  in  Figure  4.3-1.  In 
spite  of  the  small  size  of  the  image  because  the  image  expansion  facil¬ 
ity  is  not  yet  available  for  output  images  (class  and  cluster  maps), 
the  good  quality  of  the  classification  is  obvious. 

The  synthetic  data  set  results  confirm  the  satisfactory  operation  of 
the  INTERL,  FIELDEF,  CLASTAT,  and  MAXLIK  PMs. 


4.3.2  Unsupervised  Classification 

Unsupervised  classification  was  carried  out  on  the  composite  image 
SYNDATA  through  the  PM  CLUSTER.  Initial  clustering  parameters  were 
chosen  as  shown  in  Figure  4.3-2,  and  the  zero  initial  starting  vector 
selected.  While  an  iteration-by-iteration  report  of  the  development  of 
clusters  will  not  be  given  here,  it  is  interesting  to  note  that  the 
cluster  corresponding  to  the  bright  border  pixels  (corresponding  to  the 
class  BACKGROUND  in  the  supervised  classification)  begins  to  appear 
after  the  first  split  (as  CLUSTER  1,  Figure  4.3-3a),  and  is  well-defined 
after  the  second  split  (as  CLUSTER  1,  4. 3- 3b). 
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Figure  4.3-2  Initial  Clustering  Parameters  for  Synthetic  Data  Set 
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TO  RESUME  COMPUTATION  ENTER  (CR) 


Cluster  field  results  show  the  assignment  of  training  field  pixels  to 
clusters,  summarized  in  Table  4.3-5.  (as  before  certain  small  percent¬ 
age  pixel  assignments  are  not  included  in  the  summary) . 


I 


I 

I 


FIELD  NAME 

CLUSTER  NUMBER 

%  PIXELS  ASSIGNED 

FI 

6 

100 

FIELD 2 

3 

76 

12 

24 

FIELD3 

10 

72 

5 

23 

12 

4 

FIELD4 

2 

98 

FIELD5 

3 

90 

12 

10 

BACKGROUND 

1 

100 

Table  4.3-5  Assignment  of  Training  Field  Pixels* 


It  is  apparent  that  a  direct  identification  can  be  made  between  FI  and 
CLUSTER  6,  FIELD4,  and  CLUSTER  2,  FIELD5  and  CLUSTER  3,  and  BACKGROUND 
nad  CLUSTER  1,  but  CLUSTERS  5  and  10  are  confounded.  Statistics  of 
these  clusters  are  given  in  Table  4.3-6  for  comparison. 


CLUSTER  5 

CLUSTER  10 

MEAN 

STD.  DEV. 

MEAN 

STD.  DEV. 

148.09 

3.06 

147.83 

3.06 

120.44 

1.62 

120.19 

1.33 

95.66 

1.58 

95.09 

1.79 

46.55 

2.89 

46.12 

2.44 

66.14 

4.56 

58.75 

2.33 

Table  4.3-6  Cluster 

Statistics 

Comparison 

Indeed  these  two  clusters  are  quite  close,  and  perhaps  would  have  merged 
if  another  combine  iteration  had  been  specified.  Since  eleven  clusters 
were  generated,  CLUSTER  12  is  the  null  cluster  which,  it  turns  out, 


4-21 


corresponds  to  the  field  boundaries  or  "roads"  between  fields.  The 
only  surprising  result  is  the  apparent  confusion  between  FIELD2  and 
FIELD5  which  are  mostly  assigned  to  the  same  cluster,  CLUSTER  3. 
Statistics  for  these  fields  are  shown  in  Table  4.3-7  for  comparison. 


FIELD2 

MEAN 

79.40 

65.59 

36.43 

113.74 

22.93 


D  DEV. 

MEAN 

2.83 

82.56 

0.79 

77.15 

1.19 

43.97 

1.08 

114.03 

2.39 

26.10 

FIELD5 

STD.  DEV 
3.08 
1.16 
1.61 
1.44 
3.29 

Comparison 


Table  4.3-7  Field  Statistics 


There  is  a  considerable  degree  of  similarity  between  the  two  training 
fields,  and  it  would  be  interesting  to  pursue  the  matter  further. 

Figure  4.3-4  has  a  polaroid  photograph  of  the  cluster  map,  which  is  not 
as  vivid  as  it  might  be  because  of  the  particular  color  selection  used 
to  denote  clusters.  However,  the  excellent  agreement  between  cluster 
map  and  original  image  can  be  seen  by  comparing  Figure  4.3-4  with  Figure 
4.3-la.  As  a  final  comparison,  pixel  by  pixel  subimage  listings  are 
shown  in  Figure  4.3-5  for  supervised  and  unsupervised  classifications. 

It  is  concluded  that  the  principal  components  of  the  feature  extraction 
package  are  working  properly,  and  that  the  package  is  ready  for  applica¬ 
tion  to  actual  classification  problems. 
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Figure  4.3-5a  Cluster  Map  Results,  Synthetic  Data 
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Figure  4.3-5b  Classification  Map  Results,  Synthetic  Data 


Section  5 

EXPERIMENT  CONCLUSIONS  AND  RECOMMENDATIONS 


5.1  CONCLUSIONS 

During  the  period  of  performance  of  the  FEATURE  EXTRACTION  EXPERIMENT 
the  following  conclusions  evolved. 

a.  Extensive  testing  validated  the  techniques  and  software 
implementation  of  the  FEATURE  EXTRACTION  SYSTEM.  Tests  were 
run  using 

o  5-channel  Synthetic  Data  Set 

o  ERIM  6-channel  aircraft  scanner  data 

o  LANDSAT  4-channel  scenes  of  Hand  County,  South  Dakota 
and  the  results  were  analyzed  and  compared  to  ground  truth 
when  it  was  available.  The  results  compared  favorably  to 
the  ground  truth.  Further,  supervised  and  unsupervised 
classification  of  the  same  data  sets  were  in  substantial 
agreement  with  each  other. 

b.  Execution  time  of  maximum  likelihood  classification  on  the 
CDC  6400  for  a  full  LANDSAT  scene  of  3240  pixels  x  2340 
lines  and  4  channels  with  10  classes  takes  approximately 
378  minutes.  This  is  probably  too  long  a  period  of  time 
for  an  interactive  system.  However,  smaller  scenes  take 
proportionally  less  time,  the  LACIE  sets  requiring  less 
than  a  minute. 

c.  The  developed  system  has  a  broad  range  of  applications  and 
is  best  used  for  the  purpose  of  testing  and  evaluating 


processing  techniques  and  hypotheses  about  remotely  sensed 
data. 

d.  More  robust  distance  metrics  are  needed  to  help  analyze  the 
separation  (probability  of  misclassif ication)  between 
classes . 

e.  More  feedback  and  correlation  is  needed  between  the  super¬ 
vised  and  unsupervised  classification  (MAXLIK  and  CLUSTER) 
processing  modules. 

5 . 2  RECOMMENDAT IONS 

To  improve  the  performance  of  the  package,  the  following  steps  should 
be  taken. 

a.  Both  CLUSTER  and  MAXLIK  should  be  implemented  on  the  STARAN 
to  reduce  computation  time. 

b.  Processing  modules  to  reduce  the  dimensionality  of  the  data 
should  be  developed.  An  example  is  the  principal  components 
technique  which  uses  the  Hotelling  transformation. 

c.  Implementation  of  the  Probability  of  Error  and  Chemoff 
Distance  is  desirable  in  order  to  enable  the  user  to  better 
decide  whether  his  selection  of  classes  is  adequate,  and 
when  to  merge  classes. 
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